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A SYSTEM F OR CELL-R/VSED SCREENING 

Cross Reference 

This application claims priority to U.S. Provisional Applications for Patent 
Serial Nos. 60/122,152 (February 26, 1999), 60/123,399 (March «, 1999), 09/352,141, 
(July 12, 1999), 60/151,797 (August 31, 1999), 60/168,408 (December 1, 1999); and is 
a continuation in part of 09/430,656 (October 29, 1999); 09/398,965 filed September 
17, 1999 which is a continuation in part of Serial No. 09/031,271 filed February 27, 
1998 which is a continuation in part of U.S. Application S/N 08/810983, filed on 
February 27, 1997. 

Field of The Invention 

This invention is in the field of fluorescence-based cell and molecular 
biochemical assays for drug discovery. 

Background of the Invention 

Drug discovery, as currently practiced in the art, is a long, multiple step process 
involving identification of specific disease targets, development of an assay based on a 
specific target, validation of the assay, optimization and automation of the assay to 
produce a screen, high throughput screening of compound libraries using the assay to 
identify "hits", hit validation and hit compound optimization. The output of this 
process is a lead compound that goes into pre-clinical and, if validated, eventually into 
clinical trials. In this process, the screening phase is distinct from the assay 
development phases, and involves testing compound efficacy in living biological 
systems. 

Historically, drug discovery is a slow and costly process, spanning numerous 
years and consuming hundreds of millions of dollars per drug created. Developments 
in the areas of genomics and high throughput screening have resulted in increased 
capacity and efficiency in the areas of target identification and volume of compounds 
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role in the identification of potential new targets. Proteomics has become indispensible 
in relating structure and function of protein targets in order to predict drug interactions. 
However, the next level of biological complexity is the cell. Therefore, there is a need 
to acquire, manage and search multi-dimensional information from cells. Secondly, 
there is a need for higher throughput tools. Automation is a key to improving 
productivity as has already been demonstrated in DNA sequencing and high throughput 
primary screening. The instant invention provides for automated systems that extract 
multiple parameter information from cells that meet the need for higher throughput 
tools. The instant invention also provides for miniaturizing the methods, thereby 
allowing increased throughput, while decreasing the volumes of reagents and test 
compounds required in each assay. 

Radioactivity has been the dominant read-out in early drug discovery assays. 
However, the need for more information, higher throughput and miniaturization has 
caused a shift towards using fluorescence detection. Fluorescence-based reagents can 
yield more powerful, multiple parameter assays that are higher in throughput and 
information content and require lower volumes of reagents and test compounds. 
Fluorescence is also safer and less expensive than radioactivity-based methods. 

Screening of cells treated with dyes and fluorescent reagents is well known in 
the art. There is a considerable body of literature related to genetic engineering of cells 
to produce fluorescent proteins, such as modified green fluorescent protein (GFP), as a 
reporter molecule. Some properties of wild-type GFP are disclosed by Morise et al 
{Biochemistry 13 (1974), p. 2656-2662), and Ward et aL (Photochem. Photobiol 31 
(1980), P . 611-615). The GFP of the jellyfish Aeguorea victoria has an excitation 
maximum at 395 nm and an emission maximum at 510 nm, and does not require an 
exogenous factor for fluorescence activity. Uses for GFP disclosed in the literature are 
widespread and include the study of gene expression and protein localization (Chalfie 
et al., Science 263 (1994), p. 12501-12504)), as a tool for visualizing subcellular 
organelles (Rizzuto et al., Curr. Biology 5 (1995), p. 635-642)), visualization of protein 
transport along the secretory pathway (Kaether and Gerdes, FEBS Letters 369 (1995), 
P- 267-271)), expression in plant cells (Hu and Cheng, FEBS Letters 369 (1995), p.' 
331-334)) and Drosophila embryos (Davis et al., Dev. Biology 170 (1995), p. 726- 
729)), and as a reporter molecule fused to another protein of interest (U. S. Patent 
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calculate the total fluorescence per well for cell-based assays. Fluid delivery devices 
have also been incorporated into cell based screening systems, such as the FLIPR 
system, in order to initiate a response, which is then observed as a whole well 
population average response using a macro-imaging system. 

In contrast to high throughput screens, various high-content screens ("HCS") 
have been developed to address the need for more detailed information about the 
temporal-spatial dynamics of cell constituents and processes. High-content screens 
automate the extraction of multicolor fluorescence information derived from specific 
fluorescence-based reagents incorporated into cells (Giuliano and Taylor (1995), Curr. 
Op. Cell Biol. 7:4; Giuliano et al. (1995) Ann. Rev. Biophys. Biomol. Struct. 24:405). 
Cells are analyzed using an optical system that can measure spatial, as well as temporal 
dynamics. (Farkas et al. (1993) Ann. Rev. Physiol. 55:785; Giuliano et al. (1990) In 
Optical Microscopy for Biology. B. Herman and K. Jacobson (eds.), pp. 543-557. 
Wiley-Liss, New York; Hahn et al (1992) Nature 359:736; Waggoner et al. (1996) 
Hum. Pathol. 27:494). The concept is to treat each cell as a 'Veil" that has spatial and 
temporal information on the activities of the labeled constituents. 

The types of biochemical and molecular information now accessible through 
fluorescence-based reagents applied to cells include ion concentrations, membrane 
potential, specific translocations, enzyme activities, gene expression, as well as the 
presence, amounts and patterns of metabolites, proteins, lipids, carbohydrates, and 
nucleic acid sequences (DeBiasio et al., (1996) Mol. Biol. Cell. 7:1259;Giuliano et al. 
(1995) Ann. Rev. Biophys. Biomol. Struct. 24:405; Heim and Tsien, (1996) Curr Biol. 
6:178). 

High-content screens can be performed on either fixed cells, using fluorescently 
labeled antibodies, biological ligands, and/or nucleic acid hybridization probes, or live 
cells using multicolor fluorescent indicators and "biosensors." The choice of fixed or 
live cell screens depends on the specific cell-based assay required. 

Fixed cell assays are the simplest, since an array of initially living cells in a 
microtiter plate format can be treated with various compounds and doses being tested 
then the cells can be fixed, labeled with specific reagents, and measured No' 
environmental control of the cells is required after fixation. Spatial information is 
acquired, but only at one time point. The availability of thousands of antibodies, 
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248:73; Gratton et al., (1994) Proc. of the Microscopical Society of America, pp. 154- 
155) are also well established methods for acquiring high resolution images of 
microscopic samples. The principle advantage of these optical systems is the very 
shallow depth of focus, which allows features of limited axial extent to be resolved 
against the background. For example, it is possible to resolve internal cytoplasmic 
features of adherent cells from the features on the cell surface. Because scanning 
multiphoton imaging requires very short duration pulsed laser systems to achieve the 
high photon flux required, fluorescence lifetimes can also be measured in these systems 
(Lakowicz et al., (1992) Anal. Biochem. 202:316-330; Gerrittsen et al. (1997), /. of 
Fluorescence 7:11-15)), providing additional capability for different detection modes. 
Small, reliable and relatively inexpensive laser systems, such as laser diode pumped 
lasers, are now available to allow multiphoton confocal microscopy to be applied in a 
fairly routine fashion. 

A combination of the biological heterogeneity of cells in populations (Bright, et 
al., (1989). J. Cell Physiol 141:410; Giuliano, (1996) Cell Motil. Cytoskel 35:237)) as 
well as the high spatial and temporal frequency of chemical and molecular information 
present within cells, makes it impossible to extract high-content information from 
populations of cells using existing whole microliter plate readers. No existing high- 
content screening platform has been designed for multicolor, fluorescence-based 
screens using cells that are analyzed individually. Similarly, no method is currently 
available that combines automated fluid delivery to arrays of cells for the purpose of 
systematically screening compounds for the ability to induce a cellular response that is 
identified by HCS analysis, especially from cells grown in microliter plates. 
Furthermore, no method exists in the art combining high throughput well-by-well 
measurements to identify "hits" in one assay followed by a second high content cell-by- 
cell measurement on the same plate of only those wells identified as hits. 

The instant invention provides systems, methods, and screens that combine high 
throughput screening (HTS) and high content screening (HCS) that significantly 
improve target validation and candidate optimization by combining many cell screening 
formats with fluorescence-based molecular reagents and computer-based feature 
extraction, data analysis, and automation, resulting in increased quantity and speed of 
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• an XY stage adapted for holding a plate containing an array of cells and 
having a means for moving the plate for proper alignment and focusing on 
the cell arrays; 

• a digital camera; 

• a light source having optical means for directing excitation light to cell 
arrays and a means for directing fluorescent light emitted from the cells to 
the digital camera; and 

• a computer means for receiving and processing digital data from the digital 
camera wherein the computer means includes a digital frame grabber for 
receiving the images from the camera, a display for user interaction and 
display of assay results, digital storage media for data storage and archiving, 
and a means for control, acquisition, processing and display of results. 

In a preferred, embodiment, the cell screening system further comprises a 
computer screen operatively associated with the computer for displaying data. In 
another preferred embodiment, the computer means for receiving and processing digital 
data from the digital camera stores the data in a bioinformatics data base. In a further 
preferred embodiment, the cell screening system further comprises a reader that 
measures a signal from many or all the wells in parallel. In another preferred 
embodiment, the cell screening system further comprises a mechanical-optical means 
for changing the magnification of the system, to allow changing modes between high 
throughput and high content screening. In another preferred embodiment, the cell 
screening system further comprises a chamber and control system to maintain the 
temperature, C0 2 concentration and humidity surrounding the plate at levels required to 
keep cells alive. In a further preferred embodiment, the cell screening system utilizes a 
cohfocal scanning illumination and detection system. 

In another aspect of the present invention, a machine readable storage medium 
comprising a program containing a set of instructions for causing a cell screening 
system to execute procedures for defining the distribution and activity of specific 
cellular constituents and processes is provided. In a preferred embodiment, the cell 
screening system comprises a high magnification fluorescence optical system with a 
stage adapted for holding cells and a means for moving the stage, a digital camera, a 
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wherein the first domain and the third domain are separated by the second 
domain. 

In a further aspect, the present invention involves assays and reagents for 
characterizing a sample for the presence of a toxin. The method comprises the use of 
detector, classifier, and . identifier classes of toxin biosensors to provide for various 
, levels of toxin characterization. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a diagram of the components of the cell-based scanning system. 
Figure 2 shows a schematic of the microscope subassembly. 
Figure 3 shows the camera subassembly. 
Figure 4 illustrates cell scanning system process. 

Figure 5 illustrates a user interface showing major functions to guide the user. 
Figure 6 is a block diagram of the two platform architecture of the Dual Mode System 
for Cell Based Screening in which one platform uses a telescope lens to read all wells 
of a microtiter plate and a second platfoim that uses a higher magnification lens to read 
individual cells in a well. 

Figure 7 is a detail of an optical system for a single platfoim architecture of the Dual 
Mode System for Cell Based Screening that uses a moveable 'telescope' lens to read all 
wells of a microtiter plate and a moveable higher magnification lens to read individual 
cells in a well. 

Figure 8 is an illustration of the fluid delivery system for acquiring kinetic data on the 
Cell Based Screening System. 

Figure 9 is a flow chart of processing step for the cell-based scanning system. 
Figure 10 A-J illustrates the strategy of the Nuclear Translocation Assay. 
Figure 11 is a flow chart defining the processing steps in the Dual Mode System for 
Cell Based Screening combining high throughput and high content screening of 
microtiter plates. 

Figure 12 is a flow chart defining the processing steps in the High Throughput mode of 
the System for Cell Based Screening. 

Figure 13 is a flow chart defining the processing steps in the High Content mode of the 
System for Cell Based Screening. 

11 
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changes in f-actin content were highly variable and not significant. Cells were exposed 
to the compounds for 30 hours. 

Figure 28. Graphs depicting mitochondrial changes in response to induction of 
apoptosis. L929 (A3) and BHK (C,D) cells responded to both staurosporine (A,C) and 

5 paclitaxel (B,D) with increases in mitochondrial mass. MCF-7 cells exhibit either a 
decrease in membrane potential (E, staurosporine) or an increase in mitochondrial mass 
(F, paclitaxel) depending on the stimulus. Cells were exposed to the compounds for 30 
hours. 28G is a graph showing the simultaneous measurement of staurosporine effects 
on mitochondrial mass and mitochondrial potential in BHK cells. 

10 Figure 29 shows the nucleic acid and amino acid sequence for various types of 
protesae biosensor domains. (A) Signal sequences. (B) Protease recognition sites. (C) 
Product/Reactant target sequences 

Figure 30 shows schematically shows some basic organization of domains in the 
protease biosensors of the invention. 
15 Figure 31 is a schematic diagram of a specific 3-domain protease biosensor. 

Figure 32 is a photograph showing the effect of stimulation of apoptosis by cis-platin 
on BHK cells transfected with an expression vector that expresses the caspase 
biosensor shown in Figure 32. 

Figure 33 is a schematic diagram of a specific 4-domain protease biosensor. 
20 Figure 34 is a schematic diagram of a specific 4-domain protease biosensor, containing 
a nucleolar localization signal. 

Figure 35 is a schematic diagram of a specific 5-domain protease biosensor. 
Figure 36 shows the differential response in a dual labeling assay of the p38 MAPK 
and NF-kB pathways across three model toxins and two different cell types. 
25 Treatments marked with an asterisk are different from controls at a 99% confidence 
level (p< 0.01). 

DETAILED DESCRIPTION OF THE INVENTION 

All cited patents, patent applications and other references are hereby 
30 incorporated by reference in their entirety. 

As used herein, the following terms have the specified meaning: 
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High content screening (HCS) can be used to measure the effects of drugs on 
complex molecular events such as signal transduction pathways, as well as cell 
functions including, but not limited to, apoptosis, cell division, cell adhesion, 
locomotion, exocytosis, and cell-cell communication. Multicolor fluorescence permits 
5 multiple targets and cell processes to be assayed in a single screen. Cross-correlation 
of cellular responses will yield a wealth of information required for target validation 
and lead optimization. 

In one aspect of the present invention, a cell screening system is provided 
comprising a high magnification fluorescence optical system having a microscope 

10 objective, an XY stage adapted for holding a plate with an array of locations for 
holding cells and having a means for moving the plate to align the locations with the 
microscope objective and a means for moving the plate in the direction to effect 
focusing; a digital camera; a light source having optical means for directing excitation 
light to cells in the array of locations and a means for directing fluorescent light emitted 

15 from the cells to the digital camera; and a computer means for receiving and processing 
digital data from the digital camera wherein the computer means includes: a digital 
frame grabber for receiving the images from the camera, a display for user interaction 
and display of assay results, digital storage media for data storage and archiving, and 
means for control, acquisition, processing and display of results. 

20 Figure 1 is a schematic diagram of a preferred embodiment of the cell scanning 

system. An inverted fluorescence microscope is used 1, such as a Zeiss Axiovert 
inverted fluorescence microscope which uses standard objectives with magnification of 
l-100x to the camera, and a white light source (e.g. 100W mercury-arc lamp or 75W 
xenon lamp) with power supply g. There is an XY stage 3 to move the plate 4 in the 

25 XY direction over the microscope objective. A Z-axis focus drive 5 moves the 
objective in the Z direction for focusing. A joystick 6 provides for manual movement 
of the stage in the XYZ direction. A high resolution digital camera 2 acquires images 
from each well or location on the plate. There is a camera power supply an 
automation controller 2 and a central processing unit 10. The PC H provides a display 

30 12 and has associated software. The printer O provides for printing of a hard copy 
record. 
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layers. The large depth of field of wide field microscopes produces an image that is a 
projection through the many layers of cells, making analysis of subcellular spatial 
distributions extremely difficult in layer-forming cells. Alternatively, the very shallow 
depth of field that can be achieved on a confocal microscope, (about one micron), 
5 allows discrimination of a single cell layer at high resolution, simplifying the 
determination of the subcellular spatial distribution. Similarly, confocal imaging is 
preferable when detection modes such as fluorescence lifetime imaging are required. 

The output of a standard confocal imaging attachment for a microscope is a 
digital image that can be converted to the same format as the images produced by the 

10 other cell screening system embodiments described above, and can therefore be 
processed in exactly the same way as those images. The overall control, acquisition 
and analysis in this embodiment is essentially the same. The optical configuration of 
the confocal microscope system, is essentially the same as that described above, except 
for the illuminator and detectors. Illumination and detection systems required for 

15 confocal microscopy have been designed as accessories to be attached to standard 
microscope optical systems such as that of the present invention (Zeiss, Germany). 
These alternative optical systems therefore can be easily integrated into the system as 
described above. 

Figure 4 illustrates an alternative embodiment of the invention in which cell 
20 arrays are in microwells 40 on a microplate 41, described ion co-pending U.S. 
Application S/N 08/865,341, incorporated by reference herein in its entirety. Typically 
the microplate is 20 mm by 30 mm as compared to a standard 96 well microtiter plate 
which is 86 mm by 129 mm. The higher density array of cells on a microplate allows 
the microplate to be imaged at a low resolution of a few microns per pixel for high 
25 throughput and particular locations on the microplate to be imaged at a higher 
resolution of less than 0.5 microns per pixel. These two resolution modes help to 
improve the overall throughput of the system. • 

The microplate chamber 42 serves as a microfluidic delivery system for the 
addition of compounds to cells. The microplate 4J. in the microplate chamber 42 is 
30 placed in an XY microplate reader 43. Digital data is processed as described above. 
The small size of this microplate system increases throughput, minimizes reagent 
volume and allows control of the distribution and placement of cells for fast and precise 
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acquires and analyzes high resolution image data collected from individual cells within 
a well. 

The HTS software, residing on the system's computer 62, controls the high 
throughput instrument, and results are displayed on the monitor 61. The HCS software, 

5 residing on it's computer system 67, controls the high content instrument hardware 65, 
optional devices (e.g. plate loader, environmental chamber, fluid dispenser), analyzes 
digital image data from the plate, displays results on the monitor 66 and manages data 
measured in an integrated database. The two systems can also share a single computer, 
in which case all data would be collected, processed and displayed on that computer, 

10 without the need for a local area network to transfer the data. Microliter plates are 
transferred from the high throughput system to the high content system 63 either 
manually or by a robotic plate transfer device, as is well known in the art (Beggs 
(1997), supra; Mcaffrey (1996), supra). 

In a preferred embodiment, the dual mode optical system utilizes a single 
15 platform system (Figure 7). It consists of two separate optical modules, an HCS 
module 203 and an HTS module 209 that can be independently or collectively moved 
so that only one at a time is used to collect data from the microtiter plate 201. The 
microtiter plate 201 is mounted in a motorized X,Y stage so it can be positioned for 
imaging in either HTS or HCS mode. After collecting and analyzing the HTS image 
20 data as described below, the HTS optical module 209 is moved out of the optical path 
and the HCS optical module 203 is moved into place. 

The optical module for HTS 209 consists of a projection lens 214, excitation 
wavelength filter 213 and dichroic mirror 210 which are used to illuminate the whole 
bottom of the plate with a specific wavelength band from a conventional microscope 
25 lamp system (not illustrated). The fluorescence emission is collected through the 
dichroic mirror 210 and emission wavelength filter 2U by a lens 212 which forms an 
image on the camera 21j5 with sensor 215 . 

The optical module for HCS 203 consists of a projection lens 208, excitation 
wavelength filter 207 and dichroic mirror 204 which are used to illuminate the back 
30 aperture of the microscope objective 202, and thereby the field of that objective, from a 
standard microscope illumination system (not shown). The fluorescence emission is 
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microns. Methods for making microplates are described in U.S. Patent Application 
Serial No. 08/865,341, incorporated by reference herein in its entirety. Microplates 
may consist of coplanar layers of materials to which cells adhere, patterned with 
materials to which cells will not adhere, or etched 3-dimensional surfaces of similarly 
pattered materials. For the purpose of the following discussion, the terms 'weir and 
'microweir refer to a location in an array of any construction to which cells adhere and 
within which the cells are imaged. Microplates may also include fluid delivery 
channels in the spaces between the wells. The smaller format of a microplate increases 
the overall efficiency of the system by minimizing the quantities of the reagents, 
storage and handling during preparation and the overall movement required for the 
scanning operation. In addition, the whole area of the microplate can be imaged more 
efficiently, allowing a second mode of operation for the microplate reader as described 
later in this document. 
Fluorescence Reporter Molecules 

A major component of the new drug discovery paradigm is a continually 
growing family of fluorescent and luminescent reagents that are used to measure the 
temporal and spatial distribution, content, and activity of intracellular ions, metabolites, 
macromolecules, and organelles. Classes of these reagents include labeling reagents 
that measure the distribution and amount of molecules in living and fixed cells, 
environmental indicators to report signal transduction events in time and space, and 
fluorescent protein biosensors to measure target molecular activities within living cells. 
A multiparameter approach that combines several reagents in a single cell is a powerful 
new tool for drug discovery. 

The method of the present invention is based on the high affinity of fluorescent 
or luminescent molecules for specific cellular components. The affinity for specific 
components is governed by physical forces such as ionic interactions, covalent bonding 
(which includes chimeric fiision with protein-based chromophores, fluorophores, and 
lumiphores), as well as hydrophobic interactions, electrical potential, and, in some 
cases, simple entrapment within a cellular component. The luminescent probes can be 
small molecules, labeled macromolecules, or genetically engineered proteins, 
including, but not limited to green fluorescent protein chimeras. 
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Giuliano et al. (1987), Anal Biochem. 167:362-371; Thomas et al. (1979), 
Biochemistry 18:2210-2218). It can be deteraiined whether a reporter having a 
chelating group is bound to an ion, such as Ca-H-, or not (Bright et al. (1989), In 
Methods in Cell Biology, Vol. 30, Taylor and Wang (eds.), pp. 157-192; Shimoura et al. 

5 (1988), 1 of Biochemistry (Tokyo) 251:405-410; Tsien (1989) In Methods in Cell 
Biology, Vol. 30, Taylor and Wang (eds.), pp. 1 27-1 56). 

Furthermore, certain cell types within an organism may contain components 
that can be specifically labeled that, may not occur in other cell types. For example, 
epithelial cells often contain polarized membrane components. That is, these cells 

10 asymmetrically distribute macromolecules along their plasma membrane. Connective 
or supporting tissue cells often contain granules in which are trapped molecules specific 
to that cell type (e.g., heparin, histamine, serotonin, etc.). Most muscular tissue cells 
contain a sarcoplasmic reticulum, a specialized organelle whose function is to regulate 
the concentration of calcium ions within the cell cytoplasm. Many nervous tissue cells 

15 contain secretory granules and vesicles in which are trapped neurohormones or 
neurotransmitters. Therefore, fluorescent molecules can be designed to label not only 
specific components within specific cells, but also specific cells within a population of 
mixed cell types. 

Those skilled in the art will recognize a wide variety of ways to measure 

20 fluorescence. For example, some fluorescent reporter molecules exhibit a change in 

excitation or emission spectra, some exhibit resonance energy transfer where one 

fluorescent reporter loses fluorescence, while a second gains in fluorescence, some 

exhibit a loss (quenching) or appearance of fluorescence, while some report rotational 

movements (Giuliano et al. (1995), Ann. Rev. of Biophysics and Biomol Structure 

25 24:405-434; Giuliano etal. (1995), Methods in Neuroscience 2TA-l6). 

Scanning cell arrays 

Referring to Figure 9, a preferred embodiment is provided to analyze cells that 

comprises operator-directed parameters being selected based on the assay being 

conducted, data acquisition by the cell screening system on the distribution of 

30 fluorescent signals within a sample, and interactive data review and analysis. At the 

start of an automated scan the operator enters information 100 that describes the 

sample, specifies the filter settings and fluorescent channels to match the biological 
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plane focal model. Starting a programmable distance above or below this set point, the 
procedure moves the mechanical Z-axis through a number of different positions, 
acquires an image at each position, and finds the maximum of a calculated focus score 
that estimates the contrast of each image. The Z position of the image with the 
maximum focus score determines the best focus for a particular field. Those skilled in 
the art will recognize this as a variant of automatic focusing methods as described in 
Harms et al. in Cytometry 5 (1984), 236-243, Groen et al. in Cytometry 6 (1985), 81-91, 
and Firestone et al. in Cytometry 12 (199 1), 195-206. 

For image acquisition, the camera's exposure time is separately adjusted for 
each dye to ensure a high-quality image from each channel. Software procedures can be 
called, at the user's option, to correct for registration shifts between wavelengths by 
accounting for linear (X and Y) shifts between wavelengths before making any further 
measurements. The electronic shutter 18 is controlled so that sample photo-bleaching is 
kept to a minimum. Background shading and uneven illumination can be corrected by 
the software using methods known in the art (Bright et al. (1987), Cell Biol. 
104:1019-1033). 

In one channel, images are acquired of a primary marker 105 (Figure 9) 
(typically cell nuclei counterstained with DAPI or PI fluorescent dyes) which are 
segmented ("identified") using an adaptive thresholding procedure. The adaptive 
thresholding procedure 106 is used to dynamically select the threshold of an image for 
separating cells from the background. The staining of cells with fluorescent dyes can 
vary to an unknown degree across cells in a microtiter plate sample as well as within 
images of a field of cells within each well of a microtiter plate. This variation can occur 
as a result of sample preparation and/or the dynamic nature of cells. A global threshold 
is calculated for the complete image to separate the cells from background and account 
for field to field variation. These global adaptive techniques are variants of those 
described in the art. (Kittler et al. in Computer Vision, Graphics, and Image 
Processing 30 (1985), 125-147, Ridler et al. in IEEE Trans. Systems, Man, and 
Cybernetics (1978), 630-632.) 

An alternative adaptive thresholding method utilizes local region thresholding 
in contrast to global image thresholding. Image analysis of local regions leads to better 
overall segmentation since staining of cell nuclei (as well as other labeled components) 
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The area of the cytoplasmic mask 

The average fluorescent intensity of the cytoplasmic mask for colors 2-4 
(i.e. #5 divided by #6) 

The ratio of the average fluorescent intensity of the cytoplasmic mask to 
average fluorescent intensity within the cell nucleus for colors 2-4 (i.e. #7 
divided by #4) 

The difference of the average fluorescent intensity of the cytoplasmic mask 
and the average fluorescent intensity within the cell nucleus for colors 2-4 
(i.e. #7 minus #4) 

The number of fluorescent domains (also call spots, dots, or grains) within 
the cell nucleus for colors 2-4 

Features 1 through 4 are general features of the different cell screening assays 
of the invention. These steps are commonly used in a variety of image analysis 
applications and are well known in art (Russ (1992) The Image Processing Handbook, 
CRC Press Inc.; Gonzales et al. (1987), Digital Image Processing. Addison- Wesley 
. Publishing Co. pp. 391-448). Features 5-9 have been developed specifically to provide 
measurements of a cell's fluorescent molecules within the local cytoplasmic region of 
the cell and the translocation (i.e. movement) of fluorescent molecules from the 
cytoplasm to the nucleus. These features (steps 5-9) are used for analyzing cells in 
microplates for the inhibition of nuclear translocation. For example, inhibition of 
nuclear translocation of transcription factors provides a novel approach to screening 
intact cells (detailed examples of other types of screens will be provided below). A 
specific method measures the amount of probe in the nuclear region (feature 4) versus 
the local cytoplasmic region (feature 7) of each cell Quantification of the difference 
between these two sub-cellular compartments provides a measure of cytoplasm-nuclear 
translocation (feature 9). 

Feature i0 describes a screen used for counting of DNA or RNA probes within 

the nuclear region in colors 2-4. For example, probes are commercially available for 

identifying chromosome-specific DNA sequences (Life Technologies, Gaithersburg, 

MD; Genosys, Woodlands, TX; Biotechnologies, Inc., Richmond, CA; Bio 101, Inc., 

Vista, CA) Cells are three-dimensional in nature and when examined at a high 

magnification under a microscope one probe may be in-focus while another may be 

completely out-of-focus. The cell screening method of the present invention provides 

for detecting three-dimensional probes in nuclei by acquiring images from multiple 

focal planes. The software moves the Z-axis motor drive 5 (Figure 1) in small steps 
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procedure ljj). Hard copies of graphs and images can be printed on a wide range of 
standard printers. 

As a final phase of a complete scan, reports can be generated on one or more 
statistics of the measured features. Users can generate a graphical report of data 
summarized on a well-by-well basis for the scanned region of the plate using an 
interactive report generation procedure 120. This report includes a summary of the 
statistics by well in tabular and graphical format and identification information on the 
sample. The report window allows the operator to enter comments about the scan for 
later retrieval. Multiple reports can be generated on many statistics and be printed with 
the touch of one button. Reports can be previewed for placement and data before being 
printed. 

The above-recited embodiment of the method operates in a single high 
resolution mode referred to as the high content screening (HCS) mode. The HCS mode 
provides sufficient spatial resolution within a well (on the order of 1 jim) to define the 
distribution of material within the well, as well as within individual cells in the well. 
The high degree of information content accessible in that mode, comes at the expense 
of speed and complexity of the required signal processing. 

In an alternative embodiment, a high throughput system (HTS) is directly 
coupled with the HCS either on the same platform or on two separate platforms 
connected electronically (e.g. via a local area network). This embodiment of the 
invention, referred to as a dual mode optical system, has the advantage of increasing the 
throughput of an HCS by coupling it with an HTS and thereby requiring slower high 
resolution data acquisition and analysis only on the small subset of wells that show a 
response in the coupled HTS. 

High throughput 'whole plate' reader systems are well known in the art and are 
commonly used as a component of an HTS system used to screen large numbers of 
compounds (Beggs et al. (1997), supra; McCaffrey et al. (1996), supra ). The HTS of 
the present invention is carried out on the microliter plate or microwell array by reading 
many or all wells in the plate simultaneously with sufficient resolution to make 
determinations on a well-by-well basis. That is, calculations are made by averaging the 
total signal output of many or all the cells or the bulk of the material in each well. 
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more plates to be analyzed 3U the system loads the next plate 303; otherwise the 
analysis of the plates terminates 314. 

The following discussion describes the high throughput mode illustrated in 
Figure 12. The preferred embodiment of the system, the single platform dual mode 
screening system, will be described. Those skilled in the art will recognize that 
operationally the dual platform system simply involves moving the plate between two 
optical systems rather than moving the optics. Once the system has been set up and the 
plate loaded, the system begins the HTS acquisition and analysis 401 . The HTS optical 
module is selected by controlling a motorized optical positioning device 402 on the 
dual mode system. In one fluorescence channel, data from a primary marker on the 
plate is acquired 403 and wells are isolated from the plate background using a masking 
procedure 404. Images are also acquired in other fluorescence channels being used 405. 
The region in each image corresponding to each well 406 is measured 407. A feature 
calculated from the measurements for a particular well is compared with a predefined 
threshold or intensity response 408, and based on the result the well is either flagged as 
a "hit" 409 or not. The locations of the wells flagged as hits are recorded for 
subsequent high content mode processing. If there are wells remaining to be processed 
4J0 the program loops back 406 until all the wells have been processed 4U and the 
system exits high throughput mode. 

Following HTS analysis, the system starts the high content mode processing 
501 defined in Figure 13. The system selects the HCS optical module 502 by 
controlling the motorized positioning system. For each "hit" well identified in high 
throughput mode, the XY stage location of the well is retrieved from memory or disk 
and the stage is then moved to the selected stage location 503. The autofocus procedure 
504 is called for the first field in each hit well and then once every 5 to 8 fields within 
each well. In one channel, images are acquired of the primary marker 505 (typically 
cell nuclei counterstained with DAPI, Hoechst or PI fluorescent dye). The images are 
then segmented (separated into regions of nuclei and non-nuclei) using an adaptive 
thresholding procedure 50J>. The output of the segmentation procedure is a binary mask 
wherein, the objects are white and the background is black. This binary image, also 
called a mask in the art, is used to determine if the field contains objects 507. The mask 
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The kinetic live cell extension of the invention enables the design and use of 
screens in which a biological process is characterized by its kinetics instead of, or in 
addition to, its spatial characteristics. In many cases, a response in live cells can be 
measured by adding a reagent to a specific well and making multiple measurements on 
that well with the appropriate timing. This dynamic live cell embodiment of the 
invention therefore includes apparatus for fluid delivery to individual wells of the 
system in order to deliver reagents to each well at a specific time in advance of reading 
the well. This embodiment thereby allows kinetic measurements to be made with 
temporal resolution of seconds to minutes on each well of the plate. To improve the 
overall efficiency of the dynamic live cell system, the acquisition control program is 
modified to allow repetitive data collection from sub-regions of the plate, allowing the 
system to read other wells between the time points required for an individual well. 

Figure 8 describes an example of a fluid delivery device for use with the live 
cell embodiment of the invention and is described above. This set-up allows one set of 
pipette tips 705, or even a single pipette tip, to deliver reagent to all the wells on the 
plate. The bank of syringe pumps 701 can be used to deliver fluid to 12 wells 
simultaneously, or to fewer wells by removing some of the tips 705. The temporal 
resolution of the system can therefore be adjusted, without sacrificing data collection 
efficiency, by changing the number of tips and the scan pattern as follows. Typically, 
the data collection and analysis from a single well takes about 5 seconds. Moving from 
well to well and focusing in a well requires about 5 seconds, so the overall cycle time 
for a well is about 10 seconds. Therefore, if a single pipette tip is used to deliver fluid 
to a single well, and data is collected repetitively from that well, measurements can be 
made with about 5 seconds temporal resolution. If 6 pipette tips are used to deliver 
fluids to 6 wells simultaneously, and the system repetitively scans all 6 wells, each scan 
will require 60 seconds, thereby establishing the temporal resolution. For slower 
processes which only require data collection every 8 minutes, fluids can be delivered to 
one half of the plate, by moving the plate during the fluid delivery phase, and then 
repetitively scanning that half of the plate. Therefore, by adjusting the size of the sub- 
region being scanned on the plate, the temporal resolution can be adjusted without 
having to insert wait times between acquisitions. Because the system is continuously 

scanning and acquiring data, the overall time to collect a kinetic data set from the plate 
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kinetic analysis mode comprises operator identification of sub-regions of the microtiter 
plate or microwells to be screened, based on the kinetic response to be investigated, 
with data acquisitions within a sub-region prior to data acquisition in subsequent sub- 
regions. 

Specific Screens 

In another aspect of the present invention, cell screening methods and machine 
readable storage medium comprising a program containing a set of instructions for 
causing a cell screening system to execute procedures for defining the distribution and 
activity of specific cellular constituents and processes is provided. In a preferred 
embodiment, the cell screening system comprises a high magnification fluorescence 
optical system with a stage adapted for holding cells and a means for moving the stage, 
a digital camera, a light source for receiving and processing the digital data from the 
digital camera, and a computer means for receiving and processing the digital data from 
the digital camera. This aspect of the invention comprises programs that instruct the 
cell screening system to define the distribution and activity of specific cellular 
constituents and processes, using the luminescent probes, the optical imaging system, 
and the pattern recognition software of the invention. Preferred embodiments of the 
machine readable storage medium comprise programs consisting of a set of instructions 
for causing a cell screening system to execute the procedures set forth in Figures 9, 1 1, 
12, 13, 14 or 15. Another preferred embodiment comprises a program consisting of a 
set of instructions for causing a cell screening system to execute procedures for 
detecting the distribution and activity of specific cellular constituents and processes. In 
most preferred embodiments, the cellular processes include, but are not limited to, 
nuclear translocation of a protein, cellular morphology, apoptosis, receptor 
internalization, and protease-induced translocation of a protein. 

In a preferred embodiment, the cell screening, methods are used to identify 
compounds that modify the various cellular processes. The cells can be contacted with 
a test compound, and the effect of the test compound on a particular cellular process 
can be analyzed. Alternatively, the cells can be contacted with a test compound and a 
known agent that modifies the particular cellular process, to determine whether the test 
compound can inhibit or enhance the effect of the known, agent. Thus, the methods can 
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user defined parameters and valid nuclear masks are identified and used with the 
following method to extract transcription factor distributions. Each valid nuclear mask 
is eroded to define a slightly smaller nuclear region. The original nuclear mask is then 
dilated in two steps to define a ring shaped region around the nucleus, which represents 
. 5 a cytoplasmic region. The average antibody fluorescence in each of these two regions 
is determined, and the difference between these averages is defined as the NucCyt 
Difference. Two examples of determining nuclear translocation are discussed below 
and illustrated in Figure 10A-J. Figure 10A illustrates an unstimulated cell with its 
nucleus 200 labeled with a blue fluorophore and a transcription factor in the cytoplasm 

10 201 labeled with a green fluorophore. Figure 10B illustrates the nuclear mask 202 
derived by the cell-based screening system. Figure 10C illustrates the cytoplasm 203 
of the unstimulated cell imaged at a green wavelength. Figure 10D illustrates the 
nuclear mask 202 is eroded (reduced) once to define a nuclear sampling region 204 
with minimal cytoplasmic distribution. The nucleus boundary 202 is dilated (expanded) 

15 several times to form a ring that is 2-3 pixels wide that is used to define the 
cytoplasmic sampling region 205 for the same cell. Figure 10E further illustrates a side 
view which shows the nuclear sampling region 204 and the cytoplasmic sampling 
region 205. Using these two sampling regions, data on nuclear translocation can be 
automatically analyzed by the cell-based screening system on a cell by cell basis. 

20 Figure 10F-J illustrates the strategy for determining nuclear translocation in a 
stimulated cell. Figure 10F illustrates a stimulated cell with its nucleus 206 labeled with 
a blue fluorophore and a transcription factor in the cytoplasm 207 labeled with a green 
fluorophore. The nuclear mask 208 in Figure 10G is derived by the cell based 
screening system. Figure 10H illustrates the cytoplasm 209 of a stimulated cell imaged 

25 at a green wavelength. Figure 101 illustrates the nuclear sampling region 2U and 
cytoplasmic sampling region 212 of the stimulated cell. Figure 10J further illustrates a 
side view which shows the nuclear sampling region 2U and the cytoplasmic sampling 
region 212. 

A specific application of this method has been used to validate this method as a 
30 screen. A human cell line was plated in 96 well microtiter plates. Some rows of wells 
were titrated with IL-1, a known inducer of the NF-KB transcription factor. The cells 
were then fixed and stained by standard methods with a fluorescein labeled antibody to 
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to the nucleus upon activation. In another specific example, activation of the c-fos 
transcription factor was assessed by defining its spatial position within cells. Activated 
c-fos is found only within the nucleus, whiie inactivated c-fos resides within the 
cytoplasm. 

3T3 cells were plated at 5000-10000 cells per well in a Polyfiltronics 96-well 
plate. The cells were allowed to attach and grow overnight. The cells were rinsed 
twice with 100 ul serum-free medium, incubated for 24-30 hours in serum-free MEM 
culture medium, and then stimulated with platelet derived growth factor (PDGF-BB) 
(Sigma Chemical Co., St. Louis, MO) diluted directly into serum free medium at 
concentrations ranging from 1 -50 ng/ml for an average time of 20 minutes. 

Following stimulation, cells were fixed for 20 minutes in 3.7% formaldehyde 
solution in IX Hanks buffered saline solution (HBSS). After fixation, the cells were 
washed with HBSS to remove residual fixative, permeabilized for 90 seconds with 
0:5% Triton X-100 solution in HBSS, and washed twice with HBSS to remove residual 
detergent. The cells were then blocked for 15 minutes with a 0.1% solution of BSA in 
HBSS, and further washed with HBSS prior to addition of diluted primary antibody 
solution. 

c-Fos rabbit polyclonal antibody (Calbiochem, PC05) was diluted 1:50 in 
HBSS, and 50 ul of the dilution was applied to each well. Cells were incubated in the 
presence of primary antibody for one hour at room temperature, and then incubated for 
one hour at room temperature in a light tight container with goat anti-rabbit secondary 
antibody conjugated to ALEXA™ 488 (Molecular Probes), diluted 1:500 from a 100 
ug/ml stock in HBSS. Hoechst DNA dye (Molecular Probes) was then added at a 
1:1000 dilution of the manufacturer's stock solution (10 mg/ml). The cells were then 
washed with HBSS, and the plate was sealed prior to analysis with the cell screening 
system of the invention. The data from these experiments demonstrated that the 
methods of the invention could be used to measure transcriptional activation of c-fos by 
defining its spatial position within cells. 

One of skill in the art will recognize that while the following method is applied to 
detection of c-fos activation, it can be applied to the analysis of any transcription factor 
that translocates from the cytoplasm to the nucleus upon activation. Examples of such 
transcription factors include, but are not limited to fos and jun homologs, NF-KB 
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from the cytoplasm to the nucleus upon activation, and instructions for using the 
expression vector to identify compounds that modify transcription factor activation in a 
cell of interest. Alternatively, the kits contain a purified, luminescently labeled 
transcription factor. In a preferred embodiment, the transcription factor is expressed as 
a fusion protein with a luminescent protein, including but not limited to green 
fluorescent protein, luceriferase, or mutants or fragments thereof. In various preferred 
embodiments, the kit further contains cells that are transfected with the expression 
vector, an antibody or fragment that specifically bind to the transcription factor of 
interest, and/or a compound that is known to modify activation of the transcription 
factor of interest (as above). 

b. Protein Kinases 

The cytoplasm to nucleus screening methods can also be used to analyze the 
activation of any protein kinase that is present in an inactive state in the cytoplasm and 
is transported to the nucleus upon activation, or that phosphorylates a substrate that 
translocates from the cytoplasm to the nucleus upon phosphorylation. Examples of 
appropriate protein kinases include, but are not limited to extracellular signal-regulated 
protein kinases (ERKs), c-Jun amino-terminal kinases (JNKs), Fos regulating protein 
kinases (FRKs), p38 mitogen activated protein kinase (p38MAPK), protein kinase A 
(PKA), and mitogen activated protein kinase kinases (MAPKKs). (For example, see 
Hall, et al. 1999. J Biol Chem. 274:376-83; Han, et al. 1995. Biochim. Biophys. Acta. 
1265:224-227; Jaaro et al. 1997. Proc. Natl. Acad. Sci. U.S.A. 94:3742-3747; Taylor, et 
al. 1994. J. Biol. Chem. 269:308-318; Zhao, Q., and F. S. Lee. 1999. J Biol Chem. 
274:8355-8; Paolilloet al. 1999. J Biol Chem. 274:6546-52; Coso et al. 1995. Cell 
81:1137-1146; Tibbies, L.A., and J.R. Woodgett. 1999. Cell Mol Life Sci. 55:1230-54; 
Schaeffer, H. J., and M. J. Weber. 1 999. Mol Cell Biol. 1 9:2435-44.) 

Alternatively, protein kinase activity is assayed by monitoring translocation of a 
luminescently labeled protein kinase substrate from the cytoplasm to the nucleus after 
being phosphorylated by the protein kinase of interest In this embodiment, the 
substrate is non-phosphorylated and cytoplasmic prior to phosphorylation, and is 
translocated to the nucleus upon phosphorylation by the protein kinase. There is no 
requirement that the protein kinase itself translocates from the cytoplasm to the nucleus 
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In another aspect, kits are provided for analyzing protein kinase activation, 
comprising a primary antibody that specifically binds to a protein kinase, a protein 
kinase substrate, or a phosphorylated form of the protein kinase substrate of interest and 
instructions for using the primary antibody to identify compounds that modify protein 
5 kinase activation in a cell of interest. In a preferred embodiment, the primary antibody, 
or a secondary antibody that detects the primary antibody, is luminescently labeled. In 
other preferred embodiments, the kit further comprises cells that express the protein 
kinase of interest, and/or a compound that is known to modify activation of the protein 
kinase of interest, including but not limited to dibutyryl cAMP (modifies PKA), 

10 forskolin (PKA), and anisomycin (p38MAPK). 

Alternatively, the kits comprise an expression vector encoding a protein kinase 
or a protein kinase substrate of interest that translocates from the cytoplasm to the 
nucleus upon activation and instructions for using the expression vector to identify 
compounds that modify protein kinase activation in a cell of interest. Alternatively, the 

15 kits contain a purified, luminescently labeled protein kinase or protein kinase substrate. 
In a preferred embodiment, the protein kinase or protein kinase substrate of interest is 
expressed as a fusion protein with a luminescent protein. In further preferred 
embodiments, the kit further comprises cells that are transfected with the expression 
vector, an antibody or fragment thereof that specifically binds to the protein kinase or 

20 protein kinase substrate of interest, and/or a compound that is known to modify 
activation of the protein kinase of interest, (as above) 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a cell 
screening system to execute the methods disclosed for analyzing transcription factor or 

15 protein kinase activation, wherein the cell screening system comprises an optical 
system with a stage adapted for holding a plate containing cells, a digital camera, a 
means for directing fluorescence or luminescence emitted from the cells to the digital 
camera, and a computer means for receiving and processing the digital data from the 
digital camera. 
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CELL SIZE AND AREA MARKERS 

Cytoskeletal Markers 

• ALEXA™ 488 phalloidin (Molecular Probes, Oregon) 

• Tubulin-green fluorescent protein chimeras 

* Cvtokeratin-green fluorescent protein chimeras 

* Antibodies to cytoskeletal proteins 



Cytosolic Volume Markers 

• Green fluorescent proteins 

• Chloromethylfluorescein diacetate (CMFDA) 

• Calcein green 

• BCECF/AM ester 

• Rhodamine dextrah 



Cell Surface Markers for Lipid, Protein, or Oligosaccharide 

• Dihexadecvl tetramethvlindocarbocvanine perchlorate (DilCl 6) lipid dyes 

• Triethylammonium propyl dibutylamino styryl pyridinium (FM 4-64, FM 1-43) lipid dyes ' 

• MITOTRACKER™ Green FM ■ 

• Lectins to oligosaccarides such as fluorescein concanavalin A or wheat germ agglutinin 

» SYPRO™ Red non-specific protein markers ; 

• Antibodies to various surface proteins such as epidermal growth factor 

• Biotin labeling of surface proteins followed by fluorescent strepavidin labeleing 

Protocols for cell staining with these various agents are well known to those 
skilled in the art. Cells are stained live or after fixation and the cell area can be 

5 measured. For example, live cells stained with DiIC16 have homogeneously labeled 
plasma membranes, and the projected cross-sectional area of the cell is uniformly 
discriminated from background by fluorescence intensity of the dye. Live cells stained 
with cytosolic stains such as CMFDA produce a fluorescence intensity that is 
proportional to cell thickness. Although cell labeling is dimmer in thin regions of the 

10 cell, total cell area can be discriminated from background. Fixed cells can be stained 
with cytoskeletal markers such as ALEXA™ 488 phalloidin that label polymerized 
actin. Phalloidin does not homogeneously stain the cytoplasm, but still permits 
discrimination of the total cell area from background. 

15 Cellular hypertrophy 

A screen to analyze cellular hypertrophy is implemented using the following 
strategy. Primary rat myocytes can be cultured in 96 well plates, treated with various 
compounds and then fixed and labeled with a fluorescent marker for the cell membrane 
or cytoplasm, or cytoskeleton, such as an antibody to a cell surface marker or a 
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Additionally, one or more fluorescent antibodies to other cellular proteins, such 
as the major muscle proteins actin or myosin, can be included. Images of these 
additional labeled proteins can be acquired and stored with the above images, for later 
review, to identify anomalies in the distribution and morphology of these proteins in 
hypertrophic cells. This example of a multi-parametric screen allows for simultaneous 
analysis of cellular hypertrophy and changes in actin or myosin distribution. 

One of skill in the art will recognize that while the example analyzes myocyte 
hypertrophy, the methods can be applied to analyzing hypertrophy, or general 
morphological changes in any cell type. 

Cell morphology assays for prostate carcinoma 

Cell spreading is a measure of the response of cell surface receptors to substrate 
attachment ligands. Spreading is proportional to the ligand concentration or to the 
concentration of compounds that reduce receptor-ligand function. One example of 
selective cell-substrate attachment is prostate carcinoma cell adhesion to the 
extracellular matrix protein collagen. Prostate carcinoma cells metastasize to bone via 
selective adhesion to collagen. 

Compounds that interfere with metastasis of prostate carcinoma cells were 
screened as follows. PC3 human prostate carcinoma cells were cultured in media with 
appropriate stimulants and are passaged to collagen coated 96 well plates. Ligand 
concentration can be varied or inhibitors of cell spreading can be added to the wells. 
Examples of compounds that can affect spreading are receptor antagonists such as 
integrin- or proteoglycan-blocking antibodies, signaling inhibitors including 
phosphatidyl inositol-3 kinase inhibitors, and cytoskeletal inhibitors such as 
cytochalasin D. After two hours, cells were fixed and stained with ALEXA™ 488 
phalloidin (Molecular Probes) and Hoechst 33342 as per the protocol for cellular 
hypertrophy. The size of cells under these various conditions, as measured by 
cytoplasmic staining, can be distinguished above background levels. The number of 
cells per field is determined by measuring the number of nuclei stained with the 
Hoechst DNA dye. The area per cell is found by dividing the cytoplasmic area 
(phalloidin image) by the cell number (Hoechst image). The size of cells is 
proportional to the ligand-receptor function. Since the area is determined by ligand 
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proximal nuclear location. This example illustrates how a high throughput screen can 
be coupled with a high-content screen in the dual mode System for Cell Based 
Screening. 

G-protein coupled receptors are a large class of 7 trans-membrane domain cell 
surface receptors. Ligands for these receptors stimulate a cascade of secondary signals 
in the cell, which may include, but are not limited to, Ca** transients, cyclic AMP 
production, inositol triphosphate (IP3) production and phosphorylation. Each of these 
signals are rapid, occuring in a matter of seconds to minutes, but are also generic. For 
example, many different GPCRs produce a secondary Ca** signal when activated. 
Stimulation of a GPCR also results in the transport of that GPCR from the cell surface 
membrane to an internal, proximal nuclear compartment. This internalization is a much 
more receptor-specific indicator of activation of a particular receptor than are the 
secondary signals described above. 

Figure 19 illustrates a dual mode screen for activation of a GPCR. Cells 
carrying a stable chimera of the GPCR with a blue fluorescent protein (BFP) would be 
loaded with the acetoxymethylester form of Fluo-3, a cell permeable calcium indicator 
(green fluorescence) that is trapped in living cells by the hydrolysis of the esters. They 
would then be deposited into the wells of a microtiter plate 601. The wells would then 
be treated with an array of test compounds using a fluid delivery system, and a short 
sequence of Fluo-3 images of the whole microtiter plate would be acquired and 
analyzed for wells exhibiting a calcium response • (i.e., high throughput mode). The 
images would appear like the illustration of the microtiter plate 601 in Figure 19. A 
small number of wells, such as wells C4 and E9 in the illustration, would fluoresce 
more brightly due to the Ca 4 * released upon stimulation of the receptors. The locations 
of wells containing compounds that induced a response 602, would then be transferred 
to the HCS program and the optics switched for detailed cell by cell analysis of the blue 
fluorescence for evidence of GPCR translocation to the perinuclear region. The bottom 
of Figure 19 illustrates the two possible outcomes of the analysis of the high resolution 
cell data. The camera images a sub-region 604, of the well area 601, producing images 
of the fluorescent cells 605. In well C4, the uniform distribution of the fluorescence in 
the cells indicates that the receptor has not internalized, implying that the Ca** response 
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Example 5 High-content screen of human glucocorticoid receptor translocation 

One class of HCS involves the drug-induced dynamic redistribution of 
intracellular constituents. The human glucocorticoid receptor (hGR), a single "sensor" 
in the complex environmental response machinery of the cell, binds steroid molecules 
that have diffused into the cell. The ligand-receptor complex translocates to the 
nucleus where transcriptional activation occurs (Htun et al., Proc. Natl. Acad. Sci. 
93:4845,1996). 

In general, hormone receptors are excellent drug targets because their activity 
lies at the apex of key intracellular signaling pathways. Therefore, a high-content 
screen of hGR translocation has distinct advantage over in vitro ligand-receptor binding 
assays. The availability of up to two more channels of fluorescence in the cell 
screening system of the present invention permits the screen to contain two additional 
parameters in parallel, such as other receptors, other distinct targets or other cellular 
processes. 

Plasmid construct. A eukaryotic expression plasmid containing a coding 
sequence for a green fluorescent protein - human glucocorticoid receptor (GFP-hGR) 
chimera was prepared using GFP mutants (Palm et al., Nat. Struct. Biol. 4:361 (1997). 
The construct was used to transfect a human cervical carcinoma cell line (HeLa). 

Cell preparation and transfection. HeLa cells (ATCC CCL-2) were trypsinized 
and plated using DMEM containing 5% charcoal/dextran-treated fetal bovine serum 
(FBS) (HyClone) and 1% penicillin-streptomycin (C-DMEM) 12-24 hours prior to 
transfection and incubated at 37 P C and 5% C0 2 . Transfections were performed by 
calcium phosphate co-precipitation (Graham and Van der Eb, Virology 52:456, 1973; 
Sambrook et al., (1989). Molecular Cloning: A Laboratory Manual, Second ed. Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, 1989) or with Lipofectamine (Life 
Technologies, Gaithersburg, MD). For the calcium phosphate transfections, the 
medium was replaced, prior to transfection, with DMEM containing 5% 
charcoal/dextran-treated FBS. Cells were incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37°C and 5% C0 2 , washed 3-4 times with DMEM to 
remove the precipitate, followed by the addition of C-DMEM. 

Lipofectamine transfections were performed in serum-free DMEM without 
antibiotics according to the manufacturer's instructions (Life Technologies, 
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schematic diagrams depicts the localization of GFP-hGR within the cell before 250 (A) 
and after 251 (B) stimulation with dexamethasone. Under these experimental 
conditions, the drug induces a large portion of the cytoplasmic GFP-hGR to translocate 
into the nucleus. This redistribution is quantified by determining the integrated 
5 intensities ratio of the cytoplasmic and nuclear fluorescence in treated 255 and 
untreated 254 cells. The lower pair of fluorescence micrographs show the dynamic 
redistribution of GFP-hGR in a single cell, before 254 and after 255 treatment The 
HCS is performed on wells containing hundreds to thousands of transfected cells and 
the translocation is quantified for each cell in the field exhibiting GFP fluorescence. 
10 Although the use of a stably transfected cell line would yield the most consistently 
labeled cells, the heterogeneous levels of GFP-hGR expression induced by transient 
transfection did not interfere with analysis by the cell screening system of the present 
invention. 

To execute the screen, the cell screening system scans each well of the plate, 

15 images a population of cells in each, and analyzes cells individually. Here, two 
channels of fluorescence are used to define the cytoplasmic and nuclear distribution of 
the GFP-hGR within each cell. Depicted in Figure 21 is the graphical user interface of 
the cell screening system near the end of a GFP-hGR screen. The user interface depicts 
the parallel data collection and analysis capability of the system. The windows labeled 

20 "Nucleus" 261 and "GFP-hGR" 262 show the pair of fluorescence images being 
obtained and analyzed in a single field. The window labeled "Color Overlay" 260 is 
formed by pseudocoloring the above images and merging them so the user can 
immediately identify cellular changes. Within the "Stored Object Regions" window 
265, an image containing each analyzed cell and its neighbors is presented as it is 

25 archived. Furthermore, as the HCS data are being collected, they are analyzed, in this 
case for GFP-hGR translocation, and translated into an immediate "hit" response. The 
96 well plate depicted in the lower window of the screen 262 shows which wells have 
met a set of user-defined screening criteria. For example, a white-colored well 262 
indicates that the drug-induced translocation has exceeded a predetermined threshold 

30 value of 50%. On the other hand, a black-colored well 270 indicates that the drug being- 
tested induced less than 10% translocation. Gray-colored wells 268 indicate "hits" 
where the translocation value fell between 10% and 50%. Row "E" on the 96 well 
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Example 6 High-content screen of drug-induced apoptosis 

Apoptosis is a complex cellular program that involves myriad molecular events 
and pathways. To understand the mechanisms of drug action on this process, it is 
essential to measure as many of these events within cells as possible with temporal and 

5 spatial resolution. Therefore, an apoptosis screen that requires little cell sample 
preparation yet provides an automated readout of several apoptosis-related parameters 
would be ideal. A cell-based assay designed for the cell screening system has been 
used to simultaneously quantify several of the morphological, organellas and 
macromolecular hallmarks of paclitaxel-induced apoptosis. 

10 Cell preparation. The cells chosen for this study were mouse connective tissue 

fibroblasts (L-929; ATCC CCL-1) and a highly invasive glioblastoma cell line (SNB- 
19; ATCC CRL-2219) (Welch et al., In Vitro Cell Dev. Biol 31:610, 1995). The day 
before treatment with an apoptosis inducing drug, 3500 cells were placed into each well 
of a 96-well plate and incubated overnight at 37°C in a humidified 5% C0 2 

15 atmosphere. The following day, the culture medium was removed from each well and 
replaced with fresh medium containing various concentrations of paclitaxel (0 - 50 
fiM) from a 20 mM stock made in DMSO. The maximal concentration of DMSO used 
in these experiments was 0.25%. The cells were then incubated for 26 h as above. At 
the end of the paclitaxel treatment period, each well received fresh medium containing 

20 750 nM MitoTracker Red (Molecular Probes; Eugene, OR) and 3 jig/ml Hoechst 33342 
DNA-binding dye (Molecular Probes) and was incubated as above for 20 min. Each 
well on the plate was then washed with HBSS and fixed with 3.7% formaldehyde in 
HBSS for 15 min at room temperature. The formaldehyde was washed out with HBSS 
and the cells were permeabilized for 90 s with 0.5% (v/v) Triton X- 100, washed with 

25 HBSS, incubated with 2 U ml* 1 Bodipy FL phallacidin (Molecular Probes) for 30 min, 
and washed with HBSS. The wells on the plate were then filled with 200 jil HBSS, 
sealed, and the plate stored at 4°C if necessary. The fluorescence signals from plates 
stored this way were stable for at least two weeks after preparation. As in the nuclear 
translocation assay, fluorescence reagents can be designed to convert this assay into a 

30 live cell high-content screen. 

Image acquisition and analysis on the ArrayScan System. The fluorescence 
intensity of intracellular MitoTracker Red, Hoechst 33342, and Bodipy FL phallacidin 
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action. For example, the area, brightness, and fragmentation of the nucleus 298 and 
actin polymerization values 294 reached a maximum value when SNB-19 cells were 
treated with 10 nM paclitaxel (Figure 24; top and bottom graphs). However, 
mitochondrial potential 295 was minimal at the same concentration of paclitaxel 
(Figure 24; middle graph). The fact that all the parameters measured approached 
control levels at increasing paclitaxel concentrations (>10 nM) suggests that SNB-19 
cells have low affinity drug metabolic or clearance pathways that are compensatory at 
sufficiently high levels of the drug. Contrasting the drug sensitivity of SNB-19 cells 
297, L-929 showed a different response to paclitaxel 296. These fibroblastic cells 
showed a maximal response in many parameters at 5 uM paclitaxel, a 500-fold higher 
dose than SNB-19 cells. Furthermore, the L-929 cells did not show a sharp decrease in 
mitochondrial potential 295 at any of the paclitaxel concentrations tested. This result is 
consistent with the presence of unique apoptosis pathways between a normal and 
cancer cell line. Therefore, these results indicate that a relatively simple fluorescence 
labeling protocol can be coupled with the cell screening system of the present invention 
to produce a high-content screen of key events involved in programmed cell death. 

Background 

A key to the mechanism of apoptosis .was the discovery that, irrespective of the 
lethal stimulus, death results in identical apoptotic morphology that includes cell and 
organelle dismantling and repackaging, DNA cleavage to nucleosome sized fragments, 
and engulfinent of the fragmented cell to avoid an inflammatory response. Apoptosis is 
therefore distinct from necrosis, which is mediated more by acute trauma to a cell, 
resulting in spillage of potentially toxic and antigenic cellular components into the 
intercellular milieu, leading to an inflammatory response. 

The criteria for determining whether a cell is undergoing apoptosis (Wyllie et 
al. 1980. Int Rev Cytol. 68:251-306; Thompson, 1995. Science. 267:1456-62; Majno 
and Joris. 1995. Am J Pathol. 146:3-15; Allen et al. 1998: Cell Mol Life Sci. 54:427-45) 
include distinct morphological changes in the appearance of the cell, as well as 
alterations in biochemical and molecular markers. For example, apoptotic cells often 
undergo cytoplasmic membrane blebbing, their chromosomes rapidly condense and 
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Nuclear condensation has been reported in some cell types, such as MCF-7 
(Saunders et al. 1997. Int J Cancer. 70:214-20). Condensation appears to arise as a 
consequence of the loss of structural integrity of the euchromatin, nuclear matrix and 
nuclear lamina (Hendzel et al. 1998. J Biol Chem. 273:24470-8). During nuclear 
5 condensation, the chromatin concentrates near the margin of the nucleus, leading to the 
overall shrinkage of the nucleus. Thus, the use of nuclear morphology as a measure of 
apoptosis must take both condensation and fragmentation into account. 

Material and Methods 

10 Cells were plated into 96-welI plates at densities of 3 x 10 3 to 1 x 10 4 cells/well. 

The following day apoptotic inducers were added at indicated concentrations and cells 
were incubated for indicated time periods (usually 16-30 hours). The next day medium 
was removed and cells were stained with 5 ng/ml Hoechst (Molecular Probes, Inc.) in 
fresh medium and incubated for 30 minutes at 37°C. Cells were washed in Hank's 

15 Balanced Salt Solution (HBSS) and fixed with 3.7% formaldehyde in HBSS at room 
temperature. Cells were washed 2X with HBSS at room temperature and the plate was 
sealed. 

Quantitation of changes in nuclear morphology upon induction of apoptosis was 
accomplished by (1) measuring the effective size of the nuclear region; and (2) 
20 measuring the degree of convolution of the perimeter. The size parameter provides the 
more sensitive measure of nuclear condensation, whereas the perimeter measure 
provides a more sensitive measure of nuclear fragmentation. 

Results & Discussion 

25 L929 cells responded to both staurosporine (30 hours) and paclitaxel (30 hours) 

with a dose-dependent change in nuclear morphology (Fig 25A and 25B). BHK cells 
illustrated a slightly more complicated, yet clearly visible response. Staurosporine 
appeared to stimulate nuclear condensation at lower doses and nuclear fragmentation at 
higher doses (Fig 25C and 25D). In contrast, paclitaxel induced a consistent increase in 

30 nuclear fragmentation with increasing concentrations. The response of MCF-7 cells 
varied dramatically depending upon the apoptotic inducer. Staurosporine appeared to 
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the Hoechst stain. Derivation was accomplished by combinations of erosions and 
dilations. 

Results and Discussion 
5 Changes in f-actin content varied based on cell type and apoptotic inducer (Fig 

27). Staurosporine (30 hours) induced increases in f-actin in L929 (Fig. 27A) and BHK 
(Fig. 27B) cells. MCF-7 cells exhibited a concentration-dependent response. At low 
concentrations (Fig. 27E) there appeared to be a decrease in f-actin content. At higher 
concentrations, f-actin content increased. Paclitaxel (30 hours) treatment led to a wide 
10 variety of responses. L929 cells responded with graded increases in f-actin (Fig. 27B) 
whereas both BHK and MCF-7 responses were highly variable (Figs. 27D & 27F, 
respectively). 

Result of Evaluation: Both increases and decreases in signal intensity were 
15 measured for several cell lines and found to exhibit a concentration dependent 
response. For certain cell line/apoptotic inducer pairs this could be a statistically 
significant apoptotic indicator. 

Changes in Mitochondrial Mass/Potential 
20 Introduction 

Changes in mitochondria play a central role in apoptosis (Henkart and 
Grinsteiri. 1996. J Exp Med. 183:1293-5). Mitochondria release apoptogenic factors 
through the outer membrane and dissipate the electrochemical gradient of the inner 
membrane. This is thought to occur via formation of the mitochondria permeability 

25 transition (MPT), although it is apparently not true in all cases. An obvious 
manifestation of the formation of the MPT is collapse of the mitochondrial membrane 
potential. Inhibition of MPT by pharmacological intervention or mitochondrial 
expression of the anti-apoptotic protein Bcl-2 prevents cell death, suggesting the 
formation of the MPT may be a rate-limiting event of the death process (For review 

30 see: Kroemer et al. 1998. Annu Rev Physiol 60:619-42). It has also been observed that 
mitochondria can proliferate during stimulation of apoptosis (Mancini et al. 1997. J 
Cell Biol 138:449-69; Camilleri-Broet et al. 1998. Exp Cell Res. 239:277-92). 
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treated with 200 nM Mitotracker Green and 200 nM Mitotracker Red for 0.5 hours 
before fixation. 

Results & Discussion 

5 Induction of apoptosis by staurosporine and paclitaxel led to varying 

mitochondrial changes depending upon the stimulus. L929 cells exhibited a clear 
increase in mitochondrial mass with increasing staurosporine concentrations (Fig. 28). 
BHK cells exhibited either a decrease in membrane potential at lower concentrations of 
staurosporine, or an increase in mass at higher concentrations of staurosporine (Fig. 

10 28C). MCF-7 cells responded by a consistent decrease in mitochondrial membrane 
potential in response to increasing concentrations of staurosporine (Fig 28E). 
Increasing concentrations of paclitaxel caused consistent increases in mitochondrial 
mass (Fig 28B, 28D, and 28F). 

The mitochondrial membrane potential is measured by labeling mitochondria 

15 with both Mitotracker Green FM and Mitotracker Red (Molecular Probes, Inc). 
Mitotracker Red labeling is proportional to both mass and membrane potential. 
Mitotracker Green FM labeling is proportional to mass. The ratio of Mitotracker Red 
signal to the Mitotracker Green FM signal provides a measure of mitochondrial 
membrane potential (Poot and Pierce, 1999). This ratio normalizes the mitochondrial 

20 mass with respect to the Mitotracker Red signal. (See Figure 28G) Combining the 
ability to normalize to mitochondrial mass with a measure of the membrane potential 
allows independent assessment of both parameters. 

Result of Evaluation: Both decreases in potential and increases in mass were Observed 
25 depending on the cell line and inducer tested. Dose dependent correlation demonstrates 

that this is a promising apoptotic indicator. 

It is possible to combine multiple measures of apoptosis by exploiting the 

spectral domain of fluorescence spectroscopy. In fact, all of the nuclear morphology/f- 

actin content/mitochondrial mass/mitochondrial potential data shown earlier were 
30 collected as multiparameter assays, but were presented individually for clarity. 
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Caspase-GFP is calculated by dividing the integrated fluorescence intensity of Caspase- 
GFP in the nucleus by the integrated fluorescence intensity of the chimera in the 
cytoplasm or as a nuclear-cytoplasmic difference of GFP fluorescence. In the fixed 
time point screen this translocation ratio is calculated from data obtained from at least 

5 200 cells at each concentration of compound tested. Drug-induced translocation of 
Caspase-GFP from the cytoplasm to the nucleus is therefore correlated with an increase 
in the translocation ratio. Molecular interaction libraries including, but not limited to 
those comprising putative activators or inhibitors of apoptosis-activated enzymes are 
use to screen the indicator cell lines and identify a specific ligand for the DAS, and a 

10 pathway activated by compound activity. 

Example 5. Identification of novel steroid receptors from DAS 

Two sources of material and/or information are required to make use of this 
embodiment, which allows assessment of the function of an uncharacterized gene. 

15 First, disease associated sequence bank(s) containing cDNA sequences suitable for 
transfection into mammalian cells can be used. Because every RADE or differential 
expression experiment generates up to several hundred sequences, it is possible to 
generate an ample supply of DAS. Second, information from primary sequence 
database searches can be used to place DAS into broad categories, including, but not 

20 limited to, those that contain signal sequences, seven trans-membrane motifs, 
conserved protease active site domains, or other identifiable motifs. Based on the 
information acquired from these sources, method types and indicator cell lines to be 
transfected are selected. A large number of motifs are already well characterized and 
encoded in the linear sequences contained within the large number genes in existing 

25 genomic databases. 

In one embodiment, the following steps are taken: 

1) Information from the DAS identification experiment (including database 
searches) is used as the basis for selecting the relevant biological processes, (for 
example, look at the DAS from a tumor line for cell cycle modulation, apoptosis, 

30 metastatic proteases, etc.) 

2) Sorting of DNA sequences or DAS by identifiable motifs (ie. signal 
sequences, 7- transmembrane domains, conserved protease active site domains, etc.) 
This initial grouping will determine fluorescent tagging strategies, host cell lines, 
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Cell preparation and transfection. HeLa cells are trypsinized and plated using 
DMEM containing 5% charcoal/dextran-treated fetal bovine semm (FBS) (Hyclone) 
and 1% penicillin-streptomycin (C-DMEM) 12-24 hours prior to transfection and 
incubated at 37°C and 5% C0 2 . Transfections are performed by calcium phosphate 

5 coprecipitation or with Lipofectamine (Life Technologies). For the calcium phosphate 
transfections, the medium is replaced, prior to transfection, with DMEM containing 5% 
charcoal/dextran-treated FBS. Cells are incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37°C and 5% CO2, and washed 3-4 times with DMEM to 
remove the precipitate, followed by the addition of C-DMEM. Lipofectamine 

10 transfections are performed in serum-free DMEM without antibiotics according to the 
manufacturer's instructions. Following a 2-3 hour incubation with the DNA-liposome 
complexes, the medium is removed and replaced with C-DMEM. All transfected cells 
in 96-well microtiter plates are incubated at 33°C and 5% CO2 for 24-48 hours prior to 
drug treatment. Experiments are performed with the receptor expressed transiently in 

15 HeLa cells. 

Localization of expressed GFP-DASpp inside cells. To obtain cellular 
distribution data, nuclei of transfected cells are first labeled with 5 |xg/ml Hoechst 
33342 (Molecular Probes) in C-DMEM for 20 minutes at 33°C and 5% C0 2 . Cells are 
washed once in Hank's Balanced Salt Solution (HBSS). The cells are analyzed live or 
20 they are rinsed with HBSS, fixed for 15 min with 3.7% formaldehyde in HBSS, stained 
with Hoechst 33342, and washed before analysis. 

In a preferred embodiment, image acquisition and analysis are performed using 
the cell screening system of the present invention. The intracellular GFP-DASpp 
fluorescence signal is collected by acquiring fluorescence image pairs (GFP-DASpp 
25 and Hoechst 33342-labeled nuclei) from field cells* The image pairs obtained at each 
time point are used to define nuclear and cytoplasmic regions in each cell. Data 
demonstrating dispersed signal in the cytoplasm would be consistent with known 
steroid receptors that are DNA transcriptional activators. 

Screening for induction of GFP-DASpp translocation. Using the above 
30 construct, confirmed for appropriate expression of the GFP-DASpp, as an indicator cell 
line, a screen of various ligands is performed using a series of steroid type ligands 
including, but not limited to: estrogen, progesterone, retinoids, growth factors, 
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Methods in Enzymology 256:41-49) with antibodies labeled with a fourth color. Each 
of the four labels is imaged separately using the cell screening system, and the images 
used to calculate the amount of inhibition or activation of translocation effected by the 
test compound. To do this calculation, the images of the probes used to mark the 
5 plasma membrane and cytoplasm are used to mask the image of the immunological 
probe marking the location of intracellular Rho protein. The integrated brightness per 
unit area under each mask is used to form a translocation quotient by dividing the 
plasma membrane integrated brightness/area by the cytoplasmic integrated 
brightness/area. By comparing the translocation quotient values from control and 
10 experimental wells, the percent translocation is calculated for each potential lead 
compound. 

$-Arrestin translocation to the plasma membrane upon G-protein receptor activation. 
In another embodiment of a cytoplasm to membrane translocation high-content 

15 screen, the translocation of p-arrestin protein from the cytoplasm to the plasma 
membrane is measured in response to cell treatment. To measure the translocation, 
living indicator cells containing luminescent domain markers are treated with test 
compounds and the movement of the P-arrestin marker is measured in time and space 
using the cell screening system of the present invention. In a preferred embodiment, 

20 the indicator cells contain luminescent markers consisting of a green fluorescent protein 
p-arrestin (GFP-p-arrestin) protein chimera (Barak et al. (1997), J. Biol. Chem. 
272:27497-27500; Daaka et al. (1998), J. Biol. Chem. 273:685-688) that is expressed 
by the indicator cells through the use of transient or stable cell transfection and other 
reporters used to mark cytoplasmic and membrane domains. When the indicator cells 

25 are in the resting state, the domain marker molecules partition predominately in the 
plasma membrane or in the cytoplasm. In the high-content screen, these markers are 
used to delineate the cell cytoplasm and plasma membrane in distinct channels of 
fluorescence. When the indicator cells are treated with a test compound, the dynamic 
redistribution of the GFP-p-arrestin is recorded as a series of images over a time scale 

30 ranging from 0.1 s to 10 h. In a preferred embodiment, the time scale is 1 h. Each 
image is analyzed by a method that quantifies the movement of the GFP-p-arrestin 
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10 



the probes used to mark the endoplasmic reticulum and the Golgi domains are used to 
mask the image of the GFP-VSVG probe marking the location of intracellular GFP- 
VSVG protein. The integrated brightness per unit area under each mask is used to form 
a translocation quotient by dividing the endoplasmic reticulum integrated 
brightness/area by the Golgi integrated brightness/area. By comparing the translocation 
quotient values from control and experimental wells, the percent translocation is 
calculated for each potential lead compound. The output of the high-content screen 
relates quantitative data describing the magnitude of the translocation within a large 
number of individual cells that have been treated with test compounds of interest at 
final concentrations ranging from 10 12 M to 10' 3 M for a period ranging from 1 min to 
10 h. 



Induction and inhibition of organellar function: 
Intracellular microtubule stability. 

15 In another aspect of the invention, an automated method for identifying 

compounds that modify microtubule structure is provided. In this embodiment, 
indicator cells are treated with test compounds and the distribution of luminescent 
microtubule-labeling molecules is measured in space and time using a cell screening 
system, such as the one disclosed above. The luminescent microtubule-labeling 

20 molecules may be expressed by or added to the cells either before, together with, or 
after contacting the cells with a test compound. . 

In one embodiment of this aspect of the invention, living cells express a 
luminescently labeled protein biosensor of microtubule dynamics, comprising a protein 
that labels microtubules fused to a luminescent protein. Appropriate microtubule- 

25 labeling proteins for this aspect of the invention include, but are not limited to a and p 
tubulin isoforms, and MAP4. Preferred embodiments of the luminescent protein 
include, but are not limited to green fluorescent protein (GFP) and GFP mutants. In a 
preferred embodiment, the method involves transfecting cells with a microtubule 
labeling luminescent protein, wherein the microtubule labeling protein can be, but is 

30 not limited to, a-tubulin, p-tubulin, or microtubule-associated protein 4 (MAIM). The 
approach outlined here enables those skilled in the art to make live cell measurements 
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A variety of GFP mutants are available, all of which would be effective in this 
invention, including, but not limited to, GFP mutants which are commercially available 
(Clontech, California). 

The MAP4 construct has been introduced into several mammalian cell lines 
5 (BHK-21, Swiss 3T3, HeLa, HEK 293, LLCPK) and the organization and localization 
of tubulin has been visualized in live cells by virtue of the GFP fluorescence as an 
indicator of MAP4 localization. The construct can be expressed transiently or stable 
cell lines can be prepared by standard methods. Stable HeLa cell lines expressing the 
EGFP-MAP4 chimera have been obtained, indicating that expression of the chimera is 

10 not toxic and does not interfere with mitosis. 

Possible selectable markers for establishment and maintenance of stable cell 
lines include, but are not limited to the neomycin resistance gene, hygromycin 
resistance gene, zeocin resistance gene, puromycin resistance gene, bleomycin 
resistance gene, and blastacidin resistance gene. 

15 The utility of this method for the monitoring of microtubule assembly, 

disassembly, and rearrangement has been demonstrated by treatment of transiently and 
stably transfected cells with microtubule drugs such as paclitaxel, nocodazole, 
vincristine, or vinblastine. 

The present method provides high-content and combined high throughput-high 

20 content cell-based screens for anti-microtubule drugs, particularly as one parameter in a 
multi-parametric cancer target screen. The EGFP-MAP4 construct used herein can also 
be used as one of the components of a high-content screen that measures multiple 
signaling pathways or physiological events. In a preferred embodiment, a combined 
high throughput and high content screen is employed, wherein multiple cells in each of 

25 the locations containing cells are analyzed in a high throughput mode, and only a subset 
of the locations containing cells are analyzed in a high content mode. The high 
throughput screen can be any screen that would be useful to identify those locations 
containing cells that should be further analyzed, including, but not limited to, 
identifying locations with increased luminescence intensity, those exhibiting 

30 expression of a reporter gene, those undergoing calcium changes, and those 
undergoing pH changes. 
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3. A classifier to quantify microtubule depolymerization using a measure 
of image texture. 

5 4. A classifier to quantify apparent interconnectivity, or branching (or 

both), of the microtubules. 

5. Measurement of the kinetics of microtubule reorganization using the 
above classifiers on a time series of images of cells treated with test compounds. 

10 

In a further aspect, kits are provided for analyzing microtubule stability, 
comprising an expression vector comprising a nucleic acid that encodes a microtubule 
labeling protein and instructions for using the expression vector for carrying out the 
methods described above. In a preferred embodiment, the expression vector further 

15 comprises a nucleic acid that encodes a luminescent protein, wherein the microtubule 
binding protein and the luminescent protein thereof are expressed as a fusion protein. 
Alternatively, the kit may. contain an antibody that specifically binds to the 
microtubule-labeling protein. In a further embodiment, the kit includes cells that 
express the microtubule labeling protein. In a preferred embodiment, the cells are 

20 transfected with the expression vector. In another preferred embodiment, the kits 
further contain a compound that is known to disrupt microtubule structure, including 
but not limited to curacin, nocodazole, vincristine, or vinblastine. In another preferred 
embodiment, the kits further comprise a compound that is known to stabilize 
microtubule structure, including but not limited to taxol (paclitaxel), and 

25 discodermolide. 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a cell 
screening system to execute the disclosed methods for analyzing microtubule stability, 
wherein the cell screening system comprises an optical system with a stage adapted for 

30 holding a plate containing cells, a digital camera, a means for directing fluorescence or 
luminescence emitted from the cells to the digital camera, and a computer means for 
receiving and processing the digital data from the digital camera. 
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At high fractional values of phosphorylation, PFK-2 stimulates carbohydrate 
anabolism. 

Protein kinase A activity and localization of subunits. In another 
5 embodiment of a high-content screen, -both the domain localization and activity of 
protein kinase A (PKA) within indicator cells are measured in response to treatment 
with test compounds. 

The indicator cells contain luminescent reporters including a fluorescent protein 
biosensor of PKA activation. The fluorescent protein biosensor is constructed by 

10 introducing an environmentally sensitive fluorescent dye into the catalytic subunit of 
PKA near the site known to interact with the regulatory subunit of PKA (Harootunian 
et al (1993), Mol Biol of the Cell 4:993-1002; Johnson et al. (1996), Cell 85:149-158; 
Giuliano et al. (1995), supra). The dye can be of the ketocyanine class (Kessler, and 
Wolfbeis (1991), Spectrochimica Acta 47A:187-192) or any class that contains a 

15 protein reactive moiety and a fluorochrome whose excitation or emission spectrum is 
sensitive to solution polarity. The fluorescent protein biosensor of PKA activation is 
introduced into the indicator cells using bulk loading methodology. 

In one embodiment, living indicator cells are treated with test compounds, at 
final concentrations ranging from 10" 12 M to 10' 3 M for times ranging from 0.1 s to 10 

20 h. In a preferred embodiment, ratio image data are obtained from living treated 
indicator cells. To extract biosensor data from each time point, a ratio is made between 
each pair of images, and each pixel value is then used to calculate the fractional 
activation of PKA {e.g., separation of the catalytic and regulatory subunits after cAMP 
binding). At high fractional values of activity, PFK-2 stimulates biochemical cascades 

25 within the living cell. 

To measure the translocation of the catalytic subunit of PKA, indicator cells 
containing luminescent reporters are treated with test compounds and the movement of 
the reporters is measured in space and time using the cell screening system. The 
indicator cells contain luminescent reporters consisting of domain markers used to 

30 measure the localization of the cytoplasmic and nuclear domains. When the indicator 
cells are treated with a test compounds, the dynamic redistribution of a PKA 
fluorescent protein biosensor is recorded intracellularly as a series of images over a 
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portion of the message coding for p-actin (Kislauskis et al. (1994), J. Cell Biol. 
127:441 r-451; McCann et al. (1997), Proc. Natl. Acad. Sci. 94:5679-5684; Sutoh 
(1982), Biochemistry 21:3654-3661) is inserted into the loop region of a hairpih-shaped 
oligonucleotide with the ends tethered together due to intramolecular hybridization. At 
each end of the biosensor a fluorescence donor (fluorescein) and a fluorescence 
acceptor (rhodamine) are covalently bound. In the tethered state, the fluorescence 
energy transfer is maximal and therefore indicative of an unhybridized molecule. 
When hybridized with the mRNA coding for 0-actin, the tether is broken and energy 
transfer is lost. The complete fluorescent biosensor is introduced into the indicator 
cells using bulk loading methodology. 

In one embodiment, living indicator cells are treated with test compounds, at 
final concentrations ranging from 10" 12 M to 10" 3 M for times ranging from 0.1 s to 10 
h. In a preferred embodiment, ratio image data are obtained from living treated 
indicator cells. To extract morphometric data from each time point, a ratio is made 
between each pair of images, and each pixel value is then used to calculate the 
fractional hybridization of the labeled nucleotide. At small fractional values of 
hybridization little expression of p-actin is indicated. At high fractional values of 
hybridization, maximal expression of p-actin is indicated. Furthermore, the distribution 
of hybridized molecules within the cytoplasm of the indicator cells is also a measure of 
the physiological response of the indicator cells. 

Cell surface binding of a ligand 

Labeled insulin binding to its cell surface receptor in living cells. Cells 

whose plasma membrane domain has been labeled with a labeling reagent of a 

particular color are incubated with a solution containing insulin molecules (Lee et al. 

(1997), Biochemistry 36:2701-2708; Martinez-Zaguilan et al. (1996), Am. J. Physiol. 

270:C1438-C1446) that are labeled with a luminescent probe of a different color for an 

appropriate time under the appropriate conditions. After incubation, unbound insulin 

molecules are washed away, the cells fixed and the distribution and concentration of the 

insulin on the plasma membrane is measured. To do this, the cell membrane image is 

used as a mask for the insulin image. The integrated intensity, from the masked insulin 

image is compared to a set of images containing known amounts of labeled insulin. 
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In a second embodiment subdomains of the plasma membrane, the extracellular 
surface, the lipid bilayer, and the intracellular surface can be labeled separately and 
used as components of high content screens. In the first embodiment, the extracellular 
surface is labeled using a brief treatment with a reactive fluorescent molecule such as 
the succinimidyl ester or iodoacetamde derivatives of fluorescent dyes such as the 
fluoresceins, rhodamines, cyanines, and Bodipys. 

In a third embodiment, the extracellular surface is labeled using fluorescently 
labeled macromolecules with a high affinity for cell surface molecules. These include 
fluorescently labeled lectins such as the fluorescein, rhodamine, and cyanine 
derivatives of lectins derived from jack bean (Con A), red kidney bean 
(erythroagglutinin PHA-E), or wheat germ. 

In a fourth embodiment, fluorescently labeled antibodies with a high affinity for 
cell surface components are used to label the extracellular region of the plasma 
membrane. Extracellular regions of cell surface receptors and ion channels are 
examples of proteins that can be labeled with antibodies. 

In a fifth embodiment, the lipid bilayer of the plasma membrane is labeled with 
fluorescent molecules. These molecules include fluorescent dyes attached to long chain 
hydrophobic molecules that interact strongly with the hydrophobic region in the center 
of the plasma membrane lipid bilayer. Examples of these dyes include the PKH series 
of dyes (U.S. 4,783,401, 4,762701, and 4,859,584; available commercially from Sigma 
Chemical Company, St. Loius, MO), fluorescent phospholipids such as 
nitrobenzoxadiazole glycerophosphoethanolamine and fluorescein-derivatized 
dihexadecanoylglycerophosphoetha-nolamine, fluorescent fatty acids such as 5-butyl- 
4,4-difluoro-4-bora-3a,4a-diaza-s-indacene-3-nonanoic acid and 1-pyrenedecanoic acid 
(Molecular Probes, Inc.), fluorescent sterols including cholesteryl 4,4-difluoro-5,7- 
dimethyl-4-bora-3a,4a-diaza-s-indacene-3-dodecanoate and cholesteryl 1- 
pyrenehexanoate, and fluorescently labeled proteins that interact specifically with lipid 
bilayer components such as the fluorescein derivative of annexin V (Caltag Antibody 
Co, Burlingame, CA). 

In another embodiment, the intracellular component of the plasma membrane is 
labeled with fluorescent molecules. Examples of these molecules are the intracellular 
components of the trimeric G-protein receptor, adenylyl cyclase, and ionic transport 
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membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 
pump. 

In a third embodiment, protein chimeras consisting of a lysosomal protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the lysosomal domain. Examples of these 
components are the degradative enzymes involved in cholesterol ester hydrolysis, 
membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 
pump. 

Cytoplasmic fluorescence labeling 

In one embodiment, cell permeant fluorescent dyes (Molecular Probes, Inc.) 
with a reactive group are reacted with living cells: Reactive dyes including 
monobromobimane, 5-chlpromethylfluorescein diacetate, carboxy fluorescein diacetate 
succinimidyl ester, and chloromethyl tetramethylrhodamine are examples of cell 
permeant fluorescent dyes that are used for long term labeling of the cytoplasm of cells. 

In a second embodiment, polar tracer molecules such as Lucifer yellow and 
cascade blue-based fluorescent dyes (Molecular Probes, Inc.) are introduced into cells 
using bulk loading methods and are also used for cytoplasmic labeling. 

In a third embodiment, antibodies against cytoplasmic components (Sigma 
Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to fluorescently 
label the cytoplasm. Examples of cytoplasmic antigens are many of the enzymes 
involved in intermediary metabolism. Enolase, phosphofructokinase, and acetyl-CoA 
dehydrogenase are examples of uniformly distributed cytoplasmic antigens. 

In a fourth embodiment, protein chimeras consisting of a cytoplasmic protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the cytoplasm. Fluorescent chimeras of 
uniformly distributed proteins are used to label the entire cytoplasmic domain. 
Examples of these proteins are many of the proteins involved in intermediary 
metabolism and include enolase, lactate dehydrogenase, and hexokinase. 

In a fifth embodiment, antibodies against cytoplasmic antigens (Sigma 

Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 

cytoplasmic components that are localized in specific cytoplasmic sub-domains 
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function. DNA, RNA, histones, DNA polymerase, RNA polymerase, lamins, and 
nuclear variants of cytoplasmic proteins such as actin are examples of nuclear antigens. 

In a third embodiment, protein chimeras consisting of a nuclear protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the nuclear domain. Examples of these 
proteins are many of the proteins involved in maintaining DNA structure and function. 
Histones, DNA polymerase, RNA polymerase, lamins, and nuclear variants of 
cytoplasmic proteins such as actin are examples of nuclear proteins. 

Mitochondrial labeling 

In one embodiment, membrane permeant mitochondrial-specific luminescent 
reagents (Molecular Probes, Inc.) are used to label the mitochondria of living and fixed 
cells. These reagents include rhodamine 123, tetramethyl rosamine, JC-1, and the 
MitoTracker reactive dyes. 

In a second embodiment, antibodies against mitochondrial antigens (Sigma 
Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
mitochondrial components that are localized in specific mitochondrial domains. 
Examples of these components are the macromolecules involved in maintaining 
mitochondrial DNA structure and function. DNA, RNA, histones, DNA polymerase, 
RNA polymerase, and mitochondrial variants of cytoplasmic macromolecules such as 
mitochondrial tRNA and rRNA are examples mitochondrial antigens. Other examples 
of mitochondrial antigens are the components of the oxidative phosphorylation system 
found in the mitochondria (e.g., cytochrome c, cytochrome c oxidase, and succinate 
dehydrogenase). 

In a third embodiment, protein chimeras consisting of a mitochondrial protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the mitochondrial domain. Examples of 
these components are the macromolecules involved in maintaining mitochondrial DNA 
structure and function. Examples include histones, DNA polymerase, RNA 
polymerase, and the components of the oxidative phosphorylation system found in the 
mitochondria (e.g., cytochrome c, cytochrome c oxidase, and succinate 
dehydrogenase). 
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While many of the examples presented involve the measurement of single 
cellular processes, this is again is intended for purposes of illustration only. Multiple 
parameter high-content screens can be produced by combining several single parameter 
screens into a multiparameter high-content screen or by adding cellular parameters to 
any existing high-content screen. Furthermore, while each example is described as 
being based on either live or fixed cells, each high-content screen can be designed to be 
used with both live and fixed cells. 

Those skilled in the art will recognize a wide variety of distinct screens that can 
be developed based on the disclosure provided herein. There is a large and growing list 
of known biochemical and molecular processes in cells that involve translocations or 
reorganizations of specific components within cells. The signaling pathway from the 
cell surface to target sites within the cell involves the translocation of plasma 
membrane-associated proteins to the cytoplasm. For example, it is known that one of 
the src family of protein tyrosine kinases, pp60c-src (Walker et al (1993), J. Biol 
Chem. 268:19552-19558) translocates from the plasma membrane to the cytoplasm 
upon stimulation of fibroblasts with platelet-derived growth factor (PDGF). 
Additionally, the targets for screening can themselves be converted into fluorescence- 
based reagents that report molecular changes including ligand-binding and post- 
translocational modifications. 

Example 10. Protease Biosensors 
(1) Background 

As used herein, the following terms are defined as follows: 

• Reactant - the parent biosensor that interacts with the proteolytic enzyme. 

• Product - the signal-containing proteolytic fragments) generated by the interaction 
of the reactant with the enzyme. 

• Reactant Target Sequence - an amino acid sequence that imparts a restriction on the 
cellular distribution of the reactant to a particular subcellular domain of the cell. 

• Product Target Sequence - an amino acid sequence that imparts a restriction on the 
cellular distribution of the signal-containing produces) of the targeted enzymatic 
reaction to a particular subcellular domain of the cell. If the product is initially 
localized within a membrane bound compartment, then the Product Target 
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(1994)), enzyme-based incorporation of luminescent substrates into proteins 
(Buckler, et al., Analyt Biochem. 209:20-31 (1993); Takashi, Biochemistry. 
27:938-943 (1988)), and the incorporation of unnatural labeled amino acids into 
proteins (Noren, et al., Science. 244: 1 82- 1 88 (1 989)). 
• Detection - a means for recording the presence, position, or amount of the signal. 
The approach may be direct, if the signal is inherently fluorescent, or indirect, if, for 
example, the signal is an epitope that must be subsequently detected with a labeled 
antibody. Modes of detection include, but are not limited to, the spatial position of 
fluorescence, luminescence, or phosphorescence: (1) intensity; (2) polarization; (3) 
lifetime; (4) wavelength; (5) energy transfer,,and (6) recovery after photobieaching. 
The basic principle of the protease biosensors of the present invention is to 
spatially separate the reactants from the products generated during a proteolytic 
reaction. The separation of products from reactants occurs upon proteolytic cleavage of 
the protease recognition site within the biosensor, allowing the products to bind to, 
diffuse into, or be imported into compartments of the cell different from those of the 
reactant. This spatial separation provides a means of quantitating a proteolytic process 
directly in living or fixed cells. Some designs of the biosensor provide a means of 
restricting the reactant (uncleaved biosensor) to a particular compartment by a protein 
sequence ("reactant target sequence") that binds to or imports the biosensor into a 
compartment of the cell. These compartments include, but are not limited to any 
cellular substructure, macromolecular cellular component, membrane-limited 
organelles, or the extracellular space. Given that the characteristics of the proteolytic 
reaction are related to product concentration divided by the reactant concentration, the 
spatial separation of products and reactants provides a means of uniquely quantitating 
products and reactants in single cells, allowing a more direct measure of proteolytic 
activity. 

The molecular-based biosensors may be introduced into cells via transfection 
and the expressed chimeric proteins analyzed in transient cell populations or stable cell 
lines. They may also be pre-formed, for example by production in a prokaryotic or 
eukaryotic expression system, and the purified protein introduced into the cell via a 
number of physical mechanisms including, but not limited to, micro-injection, scrape 
loading, electroporation, signal-sequence mediated loading, etc. 
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advantage of the natural subcellular localization of these and other target proteins to 
achieve reactant targeting. Upon cleavage, the signal (with or without a product target 
sequence) is separated from the reactant to create a high-content biosensor. 

One of skill in the art will recognize that the protein biosensors of this aspect of 
the invention can be adapted to report the activity of any member of the caspase family 
of proteases, as well as any other protease, by a substitution of the appropriate protease 
recognition site in any of the constructs (see Figure 29B). These biosensors can be 
used in high-content screens to detect in vivo activation of enzymatic activity and to 
identify specific activity based on cleavage of a known recognition motif. This screen 
can be used for both live cell and fixed end-point assays, and can be combined with 
additional measurements to provide a multi-parameter assay. 

Thus, in another aspect the present invention provides recombinant nucleic acids 
encoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one detectable 
polypeptide signal; 

b. a second nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal; 
and 

c. a third nucleic acid sequence that encodes at least one reactant target 
sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encodes the at least one protease recognition site. 

In this aspect, the first and third nucleic acid sequences are separated by the 
second nucleic acid sequence, which encodes the protease recognition site. 

In a further embodiment, the recombinant nucleic acid encoding a protease 
biosensor comprises a fourth nucleic acid sequence that encodes at least one product 
target sequence, wherein the fourth nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal. 

In a further embodiment, the recombinant nucleic acid encoding a protease 
biosensor comprises a fifth nucleic acid sequence that encodes at least one detectable 
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Inherent in this embodiment is the concept that the reactant target sequence 
restricts the cellular distribution of the reactant, with redistribution of the product 
occurring after activation (ie: protease cleavage). This redistribution does not require a 
complete sequestration of products and reactants, as the product distribution can 
5 partially overlap the reactant distribution in the absence of a product targeting signal 
(see below). 

In a preferred embodiment, the recombinant protease biosensor further 
comprises a fourth domain comprising at least one product target sequence, wherein the 
fourth domain and the first domain are operatively linked and are separated from the 

1 0 third domain by the second domain. In another embodiment, the recombinant protease 
biosensor further comprises a fifth domain comprising at least one detectable 
polypeptide signal, wherein the fifth domain and the third domain are operatively 
linked and are separated from the first domain by the second domain. 

In a preferred embodiment, the detectable polypeptide signal domain (first or 

15 fifth domain) is selected from the group consisting of fluorescent proteins, luminescent 
proteins, and sequence epitopes. In a most preferred embodiment, the detectable 
polypeptide signal domain comprises a sequence selected from the group consisting of 
SEQ ID NOS:36, 38, 40, 42, 44, 46, 48, 50, and 52. 

In another preferred embodiment, the second domain comprising a protease 

20 recognition site comprises a sequence selected from the group consisting of SEQ ID 
NOS:54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 
98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, and 122. In another 
preferred embodiment, the reactant and/or target sequence domains comprise a 
sequence selected from the group consisting of SEQ ID NOS:124, 126, 128, 130, 132, 

25 134, 136, 138, 140, 142, 144, 146, 148, 150, and 152. 

In a most preferred embodiment, the recombinant protease biosensor comprises 
a sequence substantially similar to sequences selected from the group consisting of 
SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, and 34. 

In a still further embodiment, the present invention provides methods and kits 

30 for automated analysis of cells, comprising using cells that possess the protease 
biosensors of the invention to identify compounds that affect protease activity. The 
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or in vivo when placed under the control of appropriate regulatory sequences. The 
boundaries of the coding sequence are determined by a start codon at the 5' (amino) 
terminus and a translation stop codon at the 3' (carboxy) terminus. A coding sequence 
can include, but is not limited to, cDNA from prokarybtic or eukaryotic mRNA, 
genomic DNA sequences from prokaryotic or eukaryotic DNA, and synthetic DNA 
sequences. A transcription termination sequence will usually be located 3' to the coding 
sequence. 

As used herein, the term DNA "control sequences" refers collectively to 
promoter sequences, ribosome binding sites, polyadenylation signals, transcription 
termination sequences, upstream regulatory domains, enhancers, and the like, which 
collectively provide for the transcription and translation of a coding sequence in a host 
cell. Not all of these control sequences need always be present in a recombinant vector 
so long as the DNA sequence of interest is capable of being transcribed and translated 
appropriately. 

As used herein, the term "operatively linked" refers to an arrangement of 
elements wherein the components so described are configured so as to perform their 
usual function. Thus, control sequences operatively linked to a coding sequence are 
capable of effecting the expression of the coding sequence. The control sequences need 
not be contiguous with the coding sequence, so long as they function to direct the 
expression thereof. Thus, for example, intervening untranslated yet transcribed 
sequences can be present between a promoter sequence and the coding sequence and 
the promoter sequence can still be considered "operatively linked" to the coding 
sequence. 

Furthermore, a nucleic acid coding sequence is operatively linked to another 
nucleic acid coding sequences when the coding region for both nucleic acid molecules 
are capable of expression in the same reading frame. The nucleic acid sequences need 
not be contiguous, so long as they are capable of expression in the same reading frame. 
Thus, for example, intervening coding regions can be present between the specified 
nucleic acid coding sequences, and the specified nucleic acid coding regions can still be 
considered "operatively linked". 

The intervening coding sequences between the various domains of the 
biosensors can be of any length so long as the function of each domain is retained. 
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claimed herein. For example, functionally equivalent DNAs encode protease 
biosensors that are the same as those disclosed herein or that have one or more 
conservative amino acid variations, such as substitutions of non-polar residues for other 
non-polar residues or charged residues for similarly charged residues, or addition 
to/deletion from regions of the protease biosensor not critical for functionality. These 
changes include those recognized by those of skill in the art as substitutions, deletions, 
and/or additions that do not substantially alter the tertiary structure of the protein. 

As used herein, substantially similar sequences of nucleotides or amino acids 
share at least about 70%-75% identity, more preferably 80-85% identity, and most 
preferably 90-95% identity. It is recognized, however, that proteins (and DNA or 
mRNA encoding such proteins) containing less than the above-described level of 
homology (due to the degeneracy of the genetic code) or that are modified by 
conservative amino acid substitutions (or substitution of degenerate codons) are 
contemplated to be within the scope of the present invention. 

The term "heterologous" as it relates to nucleic acid sequences such as coding 
sequences and control sequences, denotes sequences that are not normally associated 
with a region of a recombinant construct, and/or are not normally associated with a 
particular cell. Thus, a "heterologous" region of a nucleic acid construct is an 
identifiable segment of nucleic acid within or attached to another nucleic acid molecule 
that is not found in association with the other molecule in nature. For example, a 
heterologous region of a construct could include a coding sequence flanked by 
sequences not found in association with the coding sequence in nature. Another 
example of a heterologous coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., synthetic sequences having codons different from the 
native gene). Similarly, a host cell transformed with a construct which is not normally 
present in the host cell would be considered heterologous for purposes of this invention. 

Within this application, unless otherwise stated, the techniques utilized may be 
found in any of several well-known references such as: Molecular Cloning: A 
Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), 
Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. 
Goeddel, 1991. Academic Press, San Diego, CA), "Guide to Protein Purification" in 
Methods in Enzymology (MP. Deutshcer, ed., (1990) Academic Press, Inc.); PCR 
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with the ligation mixtures using standard techniques. Transformed cells were selected 
on LB-agar with an appropriate antibiotic. 

Cells and transfections. For DNA transfection, BHK cells and MCF-7 cells 
5 were cultured to 50-70% confluence in 6 well plates containing 3 ml of minimal 
Eagle's medium (MEM) with 10% fetal calf serum, 1 mM L-glutamine, 50 fig/ml 
streptomycin, 50 yg/ml penicillin, 0.1 mM non-essential amino acids, 1 mM sodium 
pyruvate and 10 ng/ml of bovine insulin (for MCF-7 cell only) at 37 °C in a 5% C0 2 
incubator for about 36 hours. The cells were washed with serum free MEM media and 

10 incubated for 5 hours with 1 ml of transfection mixture containing 1 jig of the 
appropriate plasmid and 4 |ag of lipofectimine (BRL) in the serum free MEM media. 
Subsequently, the transfection medium was removed and replaced with 3 ml of normal 
culture media. The transfected cells were maintained in growth medium for at least 16 
hours before performing selection of the stable cells based on standard molecular 

15 biology methods (Ausubel et al 1995); 

Apoptosis assay. For apoptosis assays, the cells (BHK, MCF-7) stably 
transfected with the appropriate protease biosensor expression vector were plated on 
tissue culture treated 96-weII plates at 50-60% confluence and cultured overnight at 
20 37°C, 5% C0 2 . Varying concentrations of cis-platin, staurosporine, or paclitaxel in 
normal culture media were freshly prepared from stock and added to cell culture dishes 
to replace the old culture media. The cells were then observed with the cell screening 
system of the present invention at the indicated time points either as live cell 
experiments or as fixed end-point experiments. 

25 

1. Construction of 3-domain protease biosensors 

a. Caspase-3 biosensor with an annexin II reactant targeting domain 
(pljkGFP). 

The design of this biosensor is outlined in Figure 31, and its sequence is shown 
30 in SEQ ID NO:l and 2. 
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This biosensor provides a measure of the proteolytic activity around the annexin 
II cytoskeleton binding sites within the cell. Given the. dispersed nature of the 
cytoskeleton and the effectively diffuse state of cytosolic enzymes, this provides an 
effective measure of the cytoplasm in general. 

5 

Results & Discussion: 

Fig 32 illustrates images before and after stimulation of apoptosis by cis-platin 
in BHK cells, transfected with the caspase 3 biosensor. The images clearly illustrate 
accumulation of fluorescence in the nucleus. Generation of the spatial change in 

10 fluorescence is non-reversible and thus the timing of the assay is flexible. Controls for 
this biosensor include using a version in which the caspase-3-specific site has been 
omitted. In addition, disruption of the cytoskeleton with subsequent cell rounding did 
not produce the change in fluorescence distribution. Our experiments demonstrate the 
correlation of nuclear condensation with activation of caspase activity. We have also 

15 tested this biosensor in MCF-7 cells. A recent report measured a peak response in 
caspase-3 activity 6 h after stimulation of MCF-7 cells with etoposide accompanied by 
cleavage of PARP (Benjamin et al. 1998.M?/ Pharmacol. 53:446-50). However, 
another recent report found that MCF-7 cells do not possess caspase-3 activity and, in 
fact, the caspase-3 gene is functionally deleted (Janicke et al. 1998. J Biol Chem. 

20 273:9357-60). Caspase-3 activity was not detected with the caspase biosensor in MCF- 
7 cells after a 1 5 h treatment with 1 00 uM etoposide. 

Janicke et al., (1998) also indicated that many of the conventional substrates of 
easpase-3 were cleaved in MCF-7 cells upon treatment with staurosporine. Our 
experiments demonstrate that caspase activity can be measured using the biosensor in 

25 MCF-7 cells when treated with staurosporine. The maximum magnitude of the 
activation by staurosporine was approximately one-half that demonstrated with cis- 
platin in BHK cells. This also implies that the current biosensor, although designed to 
be caspase-3-specific, is indeed specific for a class of caspases rather than uniquely 
specific for caspase-3. The most likely candidate is caspase-7 (Janicke et al., 1998). 

30 These experiments also demonstrated that the biosensor can be used in multiparameter 
experiments, with the correlation of decreases in mitochondrial membrane potential, 
nuclear condensation, and caspase activation. 
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c. Caspase biosensor with a nuclear export signal 

Another approach for restricting the reactant to the cytoplasm is to actively 
restrict the reactant from the nucleus by using a nuclear export signal. Cleavage of 
such a biosensor liberates a product capable of diffusing into the nucleus. 

The Bacillus anthracis bacterium expresses a zinc metalloprotease protein 
complex called anthrax protease. Human mitogen activated protein kinase kinase 1 
(MEK 1) (Seger et aL, J. Biol. Chem. 267:25628-25631, 1992) possesses an anthrax 
protease recognition site (amino acids 1-13) (SEQ ID NO:102) (Figure 29B) that is 
cleaved after amino acid 8, as well as a nuclear export signal at amino acids 32-44 
(SEQ ID NO:140) (Figure 29Q. Human MEK 2 (Zheng and Guan, J. Biol. Chem. 
268:11435-11439, 1993) possesses an anthrax protease recognition site comprising 
amino acid residues 1-16 (SEQ ID NO:104) (Figure 29B) and a nuclear export signal 
at amino acids 36-48. (SEQ ID NO:148) (Figure 29C). 

The anthrax protease biosensor comprises Fret25 (SEQ ID NO:48) (Figure 
29A) as the signal, the anthrax protease recognition site, and the nuclear export signal 
from MEK 1 or MEK2. (SEQ ID NOS: 7-8 (MEK1); 9-10 (MEK2)) The intact 
biosensor will be retained in the cytoplasm by virture of this nuclear export signal (eg., 
the reactant target site). Upon cleavage of the fusion protein by anthrax protease, the 
NES will be separated from the GFP allowing the GFP to diffuse into the nucleus. 

2. Construction of 4- and S-domain biosensors 

For all of the examples presented above for 3-domain protease biosensors, a 
product targeting sequence, including but not -limited to those in Figure 29C, such as a 
nuclear localization sequence (NLS), can be operatively linked to the signal sequence, 
and thus cause the signal sequence to segregate from the reactant target domain after 
proteolytic cleavage. Addition of a second detectable signal domain, including but not 
limited to those in Figure 29A, operatively linked with the reactant target domain is 
also useful in allowing measurement of the reaction by multiple means. Specific 
examples of such biosensors are presented below. 



a. 
1. 



4 domain biosensors 

Caspase biosensors with nuclear localization sequences 
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sequences with different relative strengths for targeting. Using the example of the 
nuclear localization sequence (NLS) and annexin II sequences, different strengths of 
NLS have been tried with clone selection based on cytoplasmic restriction of the parent 
biosensor. Upon activation, the product targeting sequence will naturally dominate the 
5 localization of its associated detectable sequence domain because it is then separated 
from the reactant targeting sequence. 

An added benefit of using this biosensor is that the product is targeted, and thus 
concentrated, into a smaller region of the cell. Thus, smaller amounts of product are 
detectable due to the increased concentration of the product. This concentration effect 
10 is relatively insensitive to the cellular concentration of the reactant. The signal-to-noise 
ratio (SNR) of such a measurement is improved over the more dispersed distribution of 
biosensor #1. 

Similar biosensors that incorporate either the caspase 6 (SEQ ID NO:66) 
(Figure 29B) or the caspase 8 protease recognition sequence (SEQ ID NO:74) (Figure 
15 29B) can be made using the methods described above, but using the following primer 
sets: 

Primers for Caspase 6, Product target sequence = NLS (CP6GFPNLS- 
CYTO) 

1) TCA TCA TCC GGA AGA AGG AAA CGA CAA AAG CGA TCG 
20 ACA AGA CTT GTT GAA ATT GAG AAC (SEQ ID NO:159) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC 
ACC CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC 
AGT AGA CAT AGT ACT GTT GTC AAT TTC (SEQ ID NO:160) 

25 3) TCA TCA TCC GGA AGA AGG (SEQ H) NO:158) 
4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

Primers for Caspase 8, Product target sequence - NLS (CP8GFPNLS-CYTO) 

1) TCA TCA TCC GGA AGA AGG AAA CGA CAA AAG CGA TCG 

30 TAT CAA AAA GGA ATA CCA GTT GAA ACA GAC AGC GAA GAG 
CAA CCT TAT (SEQ ID NO:161) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC CTC 
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fragment, which is still intact following proteolysis by caspase-3, continues to report on 
the integrity of the microtubule cytoskeleton during the process of apoptosis via the 
second GFP molecule fused to the C-terminus of the biosensor. Therefore, this single 
chimeric protein allows simultaneous analysis of caspase-3 activity and the 
polymerization state of the microtubule cytoskeleton during apoptosis induced by a 
variety of agents. This biosensor is also useful for analysis of potential drug candidates 
that specifically target the microtubule cytoskeleton, since one can determine whether a 
particular drug induced apoptosis in addition to affecting microtubules. 

This biosensor potentially combines a unique signal for the reactant, 
fluorescence resonance energy transfer (FRET) from signal 2 to signal 1, and a unique 
signal localization for the product, nuclear accumulation of signal 1. The amount of 
product generated will also be indicated by the magnitude of the loss in FRET, but this 
will be a smaller SNR than the combination of FRET detection of reactant and spatial 
localization of the product. 

FRET can occur when the emission spectrum of a donor overlaps significantly 
the absorption spectrum of an acceptor molecule, (dos Remedios, C.G., and P.D. 
Moens. 1995. Fluorescence resonance energy transfer spectroscopy is a reliable "ruler" 
for measuring structural changes in proteins. Dispelling the problem of the unknown 
orientation factor. J Struct Biol 115:175-85; Emmanouilidou, E., A.G. Teschemacher, 
A.E. Pouli, L.L Nicholls, E.P. Seward, and G.A. Rutter. 1999. Imaging Ca(2+) 
concentration changes at the secretory vesicle surface with a recombinant targeted 
cameleon. Curt Biol 9:915-918.) The average physical distance between the donor and 
acceptor molecules should be between 1 nm and 10 nm with a preference of between 1 
nm and 6 nm. The intervening sequence length can vary considerably since the three 
dimensional structure of the peptide will determine the physical distance between donor 
and acceptor. This FRET signal can be measured as (1) the amount of quenching of the 
donor in the presence of the acceptor, (2) the amount of acceptor emission when 
exciting the donor, and/or (3) the ratio between the donor and acceptor emission. 
Alternatively, fluorescent lifetimes of donor and acceptor could be measured. 

This case adds value to the above FRET biosensor by nature of the existence of 
the reactant targeting sequence. This sequence allows the placement of the biosensor 
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3. Caspase 8 biosensor with a nucleolar localization domain (CP8GFPNUC- 
CYTO) 

This approach (diagrammed in Figure 34) utilizes a biosensor for the detection 
of caspase-8 activity. In this biosensor, a nucleolar localization signal 
(RKmTYLKSCRRMKRSGFEMSRPIPSHLT) (SEQ ID NO:130) (Figure 29C) 
(Ueki et al., Biochem. Biophys. Res. Comm. 252:97-100, 1998) was used as the 
product target sequence, and made by PCR using the primers described below. The 
PCR product was digested with BspEl and Pvul and gel purified. The vector and the 
PCR product were ligated as described above. 

Primers for Caspase 8, Nucleolar localization signal (CP8GFPNUC-CYTO): 

1) TCA TCA TCCGGA AGA AAA CGT ATA CGT ACT TAC CTC AAG 
TCC TGC AGG CGG ATG AAA AGA (SEQIDNO:163) 

2) GAA GAA CGATCG AGT AAG GTG GGA AGG AAT AGG TCG AGA 
CAT CTC AAA ACC ACT TCT TTT CAT (SEQ ID NO: 164) 

3) TCA TCA TCC GGA AGA AAA (SEQIDNO:165) 

4) GAA GAA CGA TCG AGT AAG (SEQ ID NO:166) 

The sequence of the resulting biosensor is shown in SEQ ID NO: 23-24. This 
biosensor includes the protease recognition site for caspase-8 (SEQ ID NO:74) 
(Figure 29B), A similar biosensor utilizes the protease recognition site for caspase-3. 
(SEQ ID NO:25-26) 

These biosensors could be used with other biosensors that possess the same 
product signal color that are targeted to separate compartments, such as CP3GFPNLS- 
CYTO. The products of each biosensor reaction can be uniquely measured due to 
separation of the products based on the product targeting sequences. Both products 
from CP8GFPNUC-CYTO and CP3GFPNLS-CYTO are separable due to the different 
spatial positions, nucleus vs. nucleolus, even though the colors of the products are 
exactly the same. Assessing the non-nucleolar, nuclear region in order to avoid the 
spatial overlap of the two signals would perform the measurement of CP3GFPNLS in 
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available per biosensor molecule. Aggregation of multiple fluorescent probes also can 
result in unique signals being manifested, such as FRET, self quenching, eximer 
formation, etc. This could provide a unique signal to the reactants. 

5. Tetanus/botulinum biosensor with trans-membrane targeting 
domain 

In an alternative embodiment, a trans-membrane targeting sequence is used to 
tether the reactant to cytoplasmic vesicles, and an alternative protease recognition site 
is used. The tetanus/botulinum biosensor (SEQ ID NOS:27-28 (cellubrevin); 29-30 
(synaptobrevin) consists of an NLS (SEQ ID NO:128) (Figure 29C), Fret25 signal 
domain (SEQ ID NO:52) (Figure 29A), a tetanus or botulinum zinc metalloprotease 
recognition site from cellubrevin (SEQ ID NO:106) (Figure 29B) (McMahon et aL, 
Nature 364:346-349, 1993; Martin et aL, J. Cell Biol, in press) or synaptobrevin (SEQ 
ID NO: 108) (Figure 29B) (GenBank Accession #U64520), and a trans-membrane 
sequence from cellubrevin (SEQ ID NO:146) (Figure 29C) or synaptobrevin (SEQ ID 
NO:144) (Figure 29C) at the 3'-end which tethers the biosensor to cellular vesicles. 
The N-terminus of each protein is oriented towards the cytoplasm. In the intact 
biosensor, GFP is tethered to the vesicles. Upon cleavage by the tetanus or botulinum 
zinc metalloprotease, GFP will no longer be associated with the vesicle and is free to 
diffuse throughout the cytoplasm and the nucieus. 

b. 5-domain biosensors 

1. Caspase 3 biosensor with a nuclear localization domain and a 
second signal domain operatively linked to an annexin II domain 

The design of this biosensor is outlined in Figure 35, and the sequence 
is shown in SEQ ID NO:33-34. This biosensor differs from SEQ ID NO 11-12 by 
including a second detectable signal, ECFP (SEQ ID NO:50) (Figure 29 A) (signal 2) 
operatively linked to the reactant target sequence. 

2. Caspase 3 biosensor with a nuclear localizati n sequence and a 
second signal domain operatively linked t a MAP4 projection domain 
(CP3YFPNLS-CFPCYTO) 
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(1) Detectors: general cell stress detection of a toxin; 

(2) Classifiers: perturbation of key molecular pathway(s) for detection and 
classification of a toxin; and 

(3) Identifiers: activity mediated detection and identification of a toxin or a 
group of toxins. 

Thus, in another aspect of the present invention, living cells are used as 
biosensors to interrogate the environment for the presence of toxic agents. In one 
embodiment of this aspect, an automated method for cell based toxin characterization is 
disclosed that comprises providing an array of locations containing cells to be treated 
with a test substance, wherein the cells possess at least a first luminescent reporter 
molecule comprising a detector and a second luminescent reporter molecule selected 
from the group consisting of a classifier or an identifier; contacting the cells with the 
test substance either before or after possession of the first and second luminescent 
reporter molecules by the cells; imaging or scanning multiple cells in each of the 
locations containing multiple cells to obtain luminescent signals from the detector; 
converting the luminescent signals from the detector into digital data to automatically 
measure changes in the localization, distribution, or activity of the detector on or in the 
cell, which indicates the presence of a toxin in the test substance; selectively imaging or 
scanning the locations containing cells that were contacted with test sample indicated to 
have a toxin in it to obtain luminescent signals from the second reporter molecule; 
converting the luminescent signals from the second luminescent reporter molecule into 
digital data to automatically measure changes in the localization, distribution, or 
activity of the classifier or identifier on or in the cell, wherein a change in the 
localization, distribution, structure or activity of the classifier identifies a cell pathway 
that is perturbed by the toxin present in the test substance, or wherein a change in the 
localization, distribution, structure or activity of the identifier identifies the specific 
toxin that is present in the test substance. In a preferred embodiment, the cells possess 
at least a detector, a classifier, and an identifier. In a further preferred embodiment, the 
digital data derived from the classifier is used to determine which identifiers) to 
employ for identifying the specific toxin or group of toxins. 

As used herein, the phrase "the cells possess one or more luminescent reporter 
molecules" means that the luminescent reporter molecule may be expressed as a 
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to cytoplasm translocation, receptor internalization, mitochondrial membrane potential, 
signal intensity, the spectral response of the reporter molecule, phosphorylation, 
intracellular free ion concentration, cell size, cell shape, cytoskeleton organization, 
metabolic processes, cell motility, cell substrate attachment, cell cycle events, and 
5 organellar structure and function. 

In all of these embodiments, the methods can be operated in both toxin-mimetic 
and toxin-inhibitory modes. . 

Such techniques to assess the presence of toxins are useful for methods 
including, but not limited to, monitoring the presence of environmental toxins in test 

10 samples and for toxins utilized in chemical and biological weapons; and for detecting 
the presence and characteristics of toxins during environmental remediation, drug 
discovery, clinical applications, and during the normal development and manufacturing 
process by virtually any type of industry, including but not limited to agriculture, food 
processing, automobile, electronic, textile, medical device, and petroleum industries. 

15 We have developed and characterized examples of luminescent cell-based 

reporters, distributed across the 3 sensor classes. The methods disclosed herein can be 
utilized in conjunction with computer databases, and data management, mining, 
retrieval, and display methods to extract meaningful patterns from the enormous data 
set generated by each individual reporter or a combinatorial of reporters in response to 

20 toxic agents. Such databases and bioinformatics methods include, but are not limited 
to, those disclosed in U.S. Patent Application Nos. 09/437,976, filed November 10, 
1999; 60/145,770 filed July 27, 1999 and U.S. Patent Application Serial No. to be 
assigned, filed February 19, 2000. (98,068-C) 

Any cell type can be used to carry out this aspect of the invention, including 

25 prokaryotes such as bacteria and archaebacteria, and eukaryotes, such as single celled 
fungi (for example, yeast), molds (for example, Dictyostelium), and protozoa (for 
example, Euglena). Higher eukaryotes, including, but not limited to, avian, amphibian, 
insect, and mammalian cells can also be used. 

30 

Examples of Biosensors 
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and changes in mitochondrial membrane potential, intracellular free ion concentration 
detection (for example, Ca 2+ ; H 4 ), general metabolic status, cell cycle timing events, 
and organellar structure and function. 

1. Mitochondrial Potential 

A key to maintenance of cellular homeostasis is a constant ATP energy charge. 
The cycling of ATP and its metabolites ADP, AMP, inorganic phosphate, and solution- 
phase protons is continuously adjusted to meet the catabolic and anabolic needs of the 
cell. Mitochondria are primarily responsible for maintaining a constant energy charge 
throughout the entire cell. To produce ATP from its constituents, mitochondria must 
maintain a constant membrane potential within the organelle itself. Therefore, 
measurement of this electrical potential with specific luminescent probes provides a 
sensitive and rapid readout of cellular stress. 

We have utilized a positively charged cyanine dye, JC-1 (Molecular Probes, 
Eugene, OR), which diffuses into the cell and readily partitions into the mitochondrial 
membrane, for measurement of mitochondrial potential. The photophysics of JC-1 are 
such that when the probe partitions into the mitochondrial membrane and it experiences, 
an electrical potential >140 mV, the probe aggregates and its spectral response is 
shifted to the red. At membrane potential values <140 mV, JC-1 is primarily 
monomelic and its spectral response is shifted toward the blue. Therefore, the ratio of 
two emission wavelengths (645 nm and 530 nm) of JC-1 partitioned into mitochondria 
provides a sensitive and continuous measure of mitochondrial membrane potential. 

We have been making live cell measurements in a high throughput mode as the 
basis of a generalized indicator of toxic stress. The goal of our initial experiments was 
to determine the ratio of J-aggregates of JC-1 dye to its monomelic form both before 
and after toxic stress. 
Procedure 

1 . Cells were plated and cultured up to overnight. 

2. Cells were stained with JC-1 (10 ng/ml) for 30 minutes at 37° C in a C0 2 incubator. 

3. Cells were then washed quickly with HBSS at 37°C (2 times, 150 nl/well), the 
toxins were added if required, and the entire plate was scanned in a plate reader. 
The JC-1 monomer was measured optimally with a 485 nm excitation/530 nm 
emission wavelength filter set, and the aggregates were best measured with a 590 
nm excitation/645 nm emission wavelength set. 
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heat shock proteins HSP27 and HSP70, the heat shock cognate HSC70, and the heat 
shock transcription factor HSF1. Therefore, measurement of cytoplasm to nuclear 
translocation of these proteins (and other stress proteins that translocate from the 
cytoplasm to the nucleus upon a cell stress) will provide a rapid readout of cellular 
stress. 

We have tested the response of an HSP27-GFP biosensor (SEQ ID 1 69^1 70) in 
two cell lines (BHK and HeLa) using a library of heavy metal chemical compounds as 
biological toxin stimulants to stress the cells. Briefly, cells expressing the HSP27-GFP 
biosensor are plated into 96-well microplates, and allowed to attach. The cells are then 
treated with a panel of cell stress-inducing compounds. Exclusively cytoplasmic 
localization of the fusion protein was found in unstimulated cells. 

Other similar heat shock protein biosensors (HSP-70, HSC70, and HSF1 fused 
to GFP) can be used as detectors, and are shown in SEQ ID NO: 1 71 - 1 76. 

Examples of Classifiers: 

This class of sensors detects the presence of, and further classifies toxins by 
identifying the cellular pathway(s) perturbed by the toxin. As such, this suite of sensors 
can detect and/or classify toxins into broad categories, including but not limited to 
"toxins affecting signal transduction," "toxins affecting the cytoskeleton," and "toxins 
affecting protein synthesis". Either high throughput or high content screening modes 
may be used. Classifiers can comprise compounds including but not limited to tubulin, 
microtubule-associated proteins, actin, actin-binding proteins including but not limited 
to vinculin, a-actinin, actin depolymerizing factor/cofilin, profilin, and myosin; NF-kB, 
IkB, GTP-binding proteins including but not limited to rac, rho, and cdc42j and stress- 
activated protein kinases including but not limited to p38 mitogen-activated protein 
kinase. 

1. Tubulin-cvtoskeleton 

The cell cytoskeleton plays a major role in cellular functions and processes, 
such as erido- and exocytosis, vesicle transport, and mitosis. Cytoskeleton-affecting 
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2. NF-kB . 

NF-kB is cytoplasmic at basal levels of stimulation, but upon insult translocates 
to the nucleus where it binds specific DNA Response elements and activates 
transcription of a number of genes. Translocation occurs when IkB is degraded by the 
proteosome in response to specific phosphorylation and ubiquitination events. BeB 
normally retains NF-kB in the cytoplasm via direct interaction with the protein, and 
masking of the NLS sequence of NF-kB. Therefore, although not the initial or defining 
event of the whole signal cascade, NF-kB translocation to the nucleus can serve as an 
indicator of cell stress. 

We have generated an NF-kB-GFP chimera for analysis in live cells. This was 
accomplished using standard polymerase chain reaction techniques using a 
characterized NF-kB p65 cDNA purchased from Invitrogen (Carlsbad, CA) fused to an 
EYFP PCR amplimer that was obtained from Clontech Laboratories (Palo Alto, CA). 
The resulting chimera is shown in SEQ ID NO:177-178. The two PCR products were 
ligated into an eukaryotic expression vector designed to produce the chimeric protein at 
high levels using the ubiquitous CMV promoter. 

NF-kB immunolocalization 

For further studies, we characterized endogenous NF-kB activation by 
immunolocalization in toxin treated cells. The NF-kB antibodies used in this study 
were purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, CA), and secondary 
antibodies are from Molecular Probes (Eugene, OR). 

For the 3T3 and SNB19 cell types, we determined the effective concentrations 
that yield response levels of 50% of the maximum (EC50), expressed in units of mass 
per volume (ng/ml) and units of molarity. Based on molecular weights of 17 kD for 
both TNFct and IL-la, the EC50 levels for these two compounds with 3T3 and SNB19 
cell types are given in units of molarity in Table 1. Our results demonstrated 
reproducibility of the relative responses from zero to maximum dose, but from sample 
to sample there have been occasional shifts in the baseline intensities of the response at 
zero concentration. 
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MAPK p38 lies in a pathway that is a Cascade of kinases. Thus, p38 is a 
substrate of one or more kinases, and it acts to phosphorylate one or more substrates in 
time and space within the living cell. 

The assay we present here measures, as one of its parameters, p38 activation 
5 using immunolocalization of the phosphory lated form of p3 8 in toxin-treated cells. The 
assay was developed to be flexible enough to include the simultaneous measurement of 
other parameters within the same individual cells. Because the signal transduction 
pathway mediated by the transcription factor NF-kB is also known to be involved in the 
cell stress response, we included the activation of NF-kB as a second parameter in the 
10 same assay. 

Our experiments demonstrate an immunofluorescence approach can be used to 
measure p38 MAPK activation either alone or in combination with NF-kB activation in 
the same cells. Multiple cell types, model toxins, and antibodies were tested, and 
significant stimulation of both pathways was measured in a high-content mode. The 

15 phospho-p38 antibodies used in this study were purchased from Sigma Chemical 
Company (St. Louis, MO). We report that at least two cell stress signaling pathways 
can not only be measured simultaneously, but are differentially responsive to classes of 
model toxins. Figure 36 shows the differential response of the p38 MAPK and NF-kB 
pathways across three model toxins and two different cell types. Note that when added 

20 alone, three of the model toxins (ILla, TNFa and Anisomycin) can be differentiated 
by the two assays as activators of specific pathways. 

IkB chimera 

IkB degradation is the key event leading to nuclear translocation of NF-kB and 
25 activation of the NFkB-mediated stress response. We have chosen this sensor to 
complement the NF-kB sensor as a classifier in a high-throughput mode: the 
measurement of loss of signal due to degradation of the IkB-GFP fusion protein 
requires no spatial resolution within individual cells, and as such we envision IkB 
degradation measurements being made rapidly on an entire cell substrate. 
30 This biosensor is based on fusion of the first 60 amino acids of IkB to the 

Fred25 variant of GFP. SEQ ID 179-180 This region of IkB contains all the regulatory 
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relatively low anisotropy, which can be readily measured with an imaging system. In 
another embodiment, actin can be labeled with a polarity-sensitive fluorescent reagent 
that reports changes in actin-conformation through spectral shifts of the attached 
reagent. That is, toxin-treatment will induce a conformational change in intracellular 
5 actin such that a ratio of two fluorescence wavelengths will provide a measure of actin 
ADP-ribosylation. 

Cytotoxic phospholipases - Several gram-positive bacterial species produce 
cytotoxic phospholipases. For example, Clostridium perfringens produces a 
phospholipase C specific for the cleavage of phosphoinositides. These 

10 phosphoinositides (e.g., inositol 1,4,5-trisphosphate) induce the release of calcium ions 
from intracellular organelles. An assay that can be conducted as either high-content or 
high-throughput can be constructed to measure the release of calcium ions using 
fluorescent reagents that have altered spectral properties when complexed with, the 
metal ion. Therefore, a direct consequence of the action of a phospholipase C based 

1 5 toxin can be measured as a change in cellular calcium ion concentration. 

Exfoliative toxins - These toxins are produced by several Staphylococcal 
species and can consist of several serotypes. A specific identifier for these toxins can be 
constructed by measuring the morphological changes in their target organelle, the 
desmosome, which occur at the junctions between cells. The exfoliative toxins are 

20 known to change the morphology of the desmosomes into two smaller components 
called hemidesmosomes. In the high-content assay for exfoliative toxins, epithelial cells 
whose desmosomes are luminescently labeled are subjected to image analysis. An 
method that detects the morphological change between desmosomes and 
hemidesmosomes is used to quantify the activity of the toxins on the cells. 

25 Most of these identifiers can be used in high throughput assays requiring no 

spatial resolution, as well as in high content assays. 

Several biological threat agents act as specific proteases, and thus we have 
focused on the development of fluorescent protein biosensors that report the proteolytic 
cleavage of specific amino acid sequences found within the target proteins. 

30 A number of such protease biosensors (including FRET biosensors) are 

disclosed above, such as the caspase biosensors, anthrax, tetanus, Botulinum, and the 
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CLAIMS 

We claim: 

.1. An automated method for cell based toxin characterization comprising 

-providing an array of locations containing cells to be treated with a test 
5 substance, wherein the cells possess at least a first luminescent reporter molecule 
comprising a detector and a second luminescent reporter molecule selected from the 
group consisting of a classifier or an identifier, 

-contacting the cells with the test substance either before or after possession of 
the first and second luminescent reporter molecules by the cells; wherein the 
10 localization, distribution, structure, or activity of the first and second luminescent 
reporter molecule is modified when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
cells to obtain luminescent signals from the detector; 

-converting the luminescent signals from the detector into digital data; 
15 -utilizing the digital data from the detector to automatically measure the 

localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the detector indicates the 
presence of a toxin in the test substance; 

-selectively imaging or scanning the locations containing cells that were 
20 contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
from the second reporter molecule; 

-converting the luminescent signals from the second luminescent reporter 
molecule into digital data; 

-utilizing the digital data from the second luminescent reporter molecule to 
25 automatically measure the localization, distribution, or activity of the classifier or 
identifier on or in the cell, wherein a change in the localization, distribution, structure 
or activity of the classifier identifies a cell pathway that is perturbed by the toxin 
present in the test substance, or wherein a change in the localization, distribution, 
structure or activity of the identifier identifies the specific toxin or group of toxins that 
30 are present in the test substance. 
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-utilizing the digital data from the identifier to automatically measure the 
localisation, distribution, or activity of the identifier on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the identifier identifies the 
specific toxin or group of toxins that is present in the test substance. 

5 

4. The method of claim 3 wherein the digital data derived from the classifier is 
used to select an appropriate identifier for identification of the specific toxin or group 
of toxins. 

10 5. The method of any one of claim 1-4 wherein the detector comprises a molecule 
selected from the group consisting of heat shock proteins and compounds that respond 
to changes in mitochondrial membrane potential, intracellular free ion concentration, 
cytoskeletal organization, general metabolic status, cell cycle timing events, and 
organellar structure and function. 

15 

6. The method of any one of claim 1 -5 wherein the classifier comprises a molecule 
selected from the group consisting of tubulin, microtubule-associated proteins, actin, 
actin-binding proteins, NF-kB, IkB, and stress-activated kinases. 

20 7. The method of any one of claim 1-6 wherein the cell pathway is selected from 
the group consisting of cell stress pathways, cell metabolic pathways, cell signaling 
pathways, cell growth pathways, and cell division pathways. 

8. The method of claim 1, wherein the second luminescent reporter molecule is an 
25 identifier, and the identifier identifies a toxin or group of toxins selected from the group 

consisting of proteases, ADP-ribosylating toxins, cytotoxic phospholipases, and 
exfoliative toxins. 

9. The method of any one of claim 3-7, wherein the identifier identifies a toxin or 
30 group of toxins selected from the group consisting of proteases, ADP-ribosylating 

toxins, cytotoxic phospholipases, and exfoliative toxins. 
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17. A computer readable storage medium comprising a program containing a set of 
instructions for causing a cell screening system to execute the method of any one of 
claims 1-16, wherein the cell screening system comprises an optical system with a stage 
5 adapted for holding a plate containing cells, a means for moving the stage or the optical 
system, a digital camera, a means for directing light emitted from the cells to the digital 
camera, and a computer means for receiving and processing the digital data from the 
digital camera. 

10 18. A kit for cell based toxin detection comprising: 

(a) at least one reporter molecule, wherein the localization, distribution, 
structure, or activity of the reporter molecule is modified when the cell is contacted 
with a toxin; 

(b) instructions for using the reporter molecule to carry out the method of 
15 any one of claims 1-16 to detect toxins in a test substance. 

19. The kit of claim 18 further comprising the computer readable storage medium 
of claim 17. 

20 20. An automated method for cell based toxin characterization comprising 

-providing a first array of locations containing cells to be treated with a test 
substance, wherein the cells possess a least a first luminescent reporter molecule 
comprising a reporter molecule selected from the group consisting of detectors and 
classifiers; 

25 -contacting the cells with the test substance either before or after possession of 

the first luminescent reporter molecule by the cells; wherein the localization, 
distribution, structure, or activity of the first luminescent reporter molecule is modified 
when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
30 cells to obtain luminescent signals from the detector; 

-converting the luminescent signals from the detector into digital data; 
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FIGURE 25 
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Mitochondria] Mass, Potential Data 



991007.GML.Ap.DR1.20x_cs1: Mitochondria! Mass and Potential In 24 hr Staurosporine 

treated BHK. 
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SIGNAL SEQUENCES 



EPITOPE 
FLAG epitope 


SEQUENCE 

5'GACTACAAAGACGACG 
AA Seq: ACGACAAA 




REFERENCE 
Kasir,etaL,199a J Biol 
Chem. 27444873-80. 


HA epitope 


5 ' TACCCATACGACGTACCAGACTACGCA 
AA Seq: YPYDVPDYA 


37 
38 


Smith, et al M 1999.JBioi 
Chem. 274:19894-900. 


KT3 epitope 


5 ' CCACCAGAACCAGAAACA : ' 
AA seq: PPEPET 


39 
40 


MacArthur and Walter. 
1984. J Virol. 52:483-91. 


Myc epitope 


5 ' GCAGAAGAACAAAAATTAATAAGCGAAGA 
AGACTTA 

AA Seq: AEEQKLISEEDL 


41 
42 


Gosney, etal.,1990. 
Anticancer Res. 10:623-8. 



EYFP: SEQ ID NO: 43 (Nucleic acid); SEQ ID NO:44 (Amino acid) 

MVSK GEEL PTGV V P I L V ELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

G D V N 
GGCGACGTAAAC 



G H K F S V S 



E G E G D A T Y 
GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 



G K L T 
GGCAAGCTGACC 

L V T - T 
CTCGTGACCACC 

Q H D F 
CAGCACGACTTC 

P K D D 
TTCAAGGACGAC 

V N R I 
GTGAACCGCATC 

K L E Y 
AAGCTGGAGTAC 



LKFI CTT G KL PV PWPT 
CTGAAGTTCATC TGCACCACCGGC .AAGCTGCCCGTG CCCTGGCCCACC 



F G Y G 



L Q 



C F A R Y p D H M K 
TTCGGCTACGGC CTGCAGTGCTTC GCCCGCTACCCC GACCACATGAAG 

FKSA MPEG Y VQE RTIF 
TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

GNYK TR AE V K F B GDTL 
GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

ELKG ID F K E D G N I L G H- 
GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGGCAC 

NYNS HNV Y I. MAD KQ KN 
AACTACAACAGC CACAACGTCTAT ' ATCATGGCCGAC AAGCAGAAGAAC 



G I 



K V NFKI RHN_ 
GGCATCAAGGTG AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTg2aGCTCGCC 

D HYQ Q NTP IGD G P V L L PDN W 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 



I EDG S V ' Q L A 



aw 
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Y L S Y QSALSKDP N EK R DHM V 
TACCTGAGCTAC CAGTCCGCCCTG AGCAAAG ACCCC AACGAGAAGCGC GATCACATGGTC 

L L B F VTAA GITL G M D E LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



EGFP: SEQ ID NO:45 (Nucleic acid); SEQ ID NO:46 (Amino acid) 

MVSK GEEL FTG V V P I L VELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

G D V N GHKF SVSG EGEG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

G K L T L K FI CTTG KLPV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC AAGCTGCCCGTG CCCTCGCCCACC 

LVTT L TY.G VQCF SRYP D H M K 
CTCGTGACCACC CTGACCTACGGC GTGCAGTGCTTC AGCCGCTACCCC GACCACATGAAG 

QHDF FKS A MP.EG YVQE R TIF 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

F KDD GNY K TRAE V.KFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

V NRI E LKG IDFK EDGN ILGH 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGGCAC 

KL EY NYNS HNVY I M A D KQKN 
AAG CTGGAGTAC AACTACAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV H P K ' I RHHI ED GS V Q L A 
GGCATCAAGGTG AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

D HYQ QHTP I GDG PVLL PD H H 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YL ST QSA L SKDP NEKR DHMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 

L L E F VTAA GITL G M D E LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



EBPPs SEQIDNO:47 (Nucleic acid); SEQ ID NO:48 (Amino acid) 

MVSK G.EE L F TGV VPIl VELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 
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0 » V > G H K P S V S G E G B G D 



GGCGACGTAAAC GG CCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC Ga4 JcCTAC 
GGCAAGCTGACC CTgLg^CATC TG CACcIcCGGC LLcCcL KcLcCCACC 

l v t t lthg vqcp SRYd 

CTCGXGACCACC CTGACCCACGGC GTGCAGT3 C7TC AGCCGCTACCCC LcAcLL 

Q.HDP PKS A M PEG YVQB R T t „" 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGcIcCATC^TC 

TTcLLL GGcLcTACAAG Lc^GCGCCGAG GTGAAGTTCGAG GGCGAcLcC^ 
V N R I ELKG IDFK ED rw tt 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTgLLc 

K L E Y M p H s H N V Y I M A D K n v « 

aagctggagtac aacttcaacagc cacaacgtctat atcatggccgac LgcagLaac 

GI K V NPKI RHNI EDGS VOT a 

ggcatcaaggtg aacttcaagatc cgccacaacatc gaggacggcagc oigLctcScc 

DHYQ QMTp igd g PVL L PDuh 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGAcLcL 

L LST OS AL SKDP NEKR Dhmw 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATgL 

LLE F V T A A G I T L G M D E TVv 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



ECPP s SEQ ID NO:49 (Nucleic acid); SEQ ID NO:50 (Amino acid) 

M V S K G E E I» F T G V V P T x. v p r n 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

G.DVN G HKF SVSG EGEG n * T v 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC JcCTAC 

ggcaagctgIcc ctgaagttcItc LIccaccSgc Lctgcccgtg ccctggcccIcc 

L V T T . L TWO V " Q C F S * R V p r. u « 
CTCGTGACCACC CTGACCTGGGGC GTGCAGTGCTTC AGCCGCTACCCC LcACATgL 

O HDF F R SA MPEG Y V Q E RTTP 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 
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F K D D G H Y K T R A E V K P E O D T L 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNR I E L K G I D P K EDG N ILGH 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGGCAC 

KLEY NYIS HN V Y I T A D K Q K N 
AAGCTGGAGTAC AACTACATCAGC CACAACGTCTAT ATCACCGCCGAC AAGCAGAAGAAC 
G I K A NP KI R H N I E DGS VQLA 
GGCATCAAGGCC AACTTCAAGATC CGGCACAACATC GAGGACGGCAGC GTGCAG CTCGCC 

DHYQ Q HTP IGDG P V L L PDNH 
.GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YLST Q S A L SKDP NEKR D HMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 

L L E F VT'AA G I TL GMDE L Y K 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG . CTGTACAAG 



Pred25: SEQIDNO:51 (Nucleic acid); SEQ ID NO:52 (Amino acid) 

MAS K GEEL FTGV VPIL VELD 
ATGGCTAGCAAA GGAGAAGAACTC TTCACTGGAGTT GTCCCAATTCTT GTTGAATTAGAT 

GDVN GHKF S VSG EGEG DAT Y 
GGTGATGTTAAC GGCCACAAGTTC TCTGTCAGTGGA GAGGGTGAAGGT GATGCAACATAC 

GK LT L K F I CTTG JC L PV PWPT 
GGAAAACTTACC CTGAAGTTCATC TGCACTACTGGC AAACTGCCTGTT .CCATGGCCAACA 

L V T T L CYG VQ CF S.RYP DH MK 
CTAGTCACTACT CTGTGCTATGGT GTTCAATGCTTT TCAAGATACCCG GATCATATGAAA 

R H ° * P K S A M PEG Y V Q E R T I F 

CGGCATGACTTT TTCAAGAGTGCC ATGCCCGAAGGT TATGTACAGGAA AGGACCATCTTC 

F K D D CMYK T R A E V K F E G D T L 
TTCAAAGATGAC GGCAACTACAAG ACACGTGCTGAA GTCAAGTTTGAA GGTGATACCCTT 

V N R I E L K G ID FK E D G N ILGH 
GTTAATAGAATC GAGTTAAAAGGT ATTGACTTCAAG GAAGATGGCAAC ATTCTGGGACAC 

K L E Y N Y N S H N V Y IMA D K Q K N 
AAATTGGAATAC AACTATAACTCA CACAATGTATAC ATCATGGCAGAC AAACAAAAGAAT 

G IKV N FK T RHN .I EDGS VQLA 
GGAATCAAAGTG AACTTCAAGACC CGCCAGAACATT GAAGATGGAAGC GTTCAACTAGCA 



DHYQ Q N T P IGDG P V L L PDNH 
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GACCATTATCAA CAAAATACTCCA ATTGGCGATGGC CCTGTCCTTTTA CCAGACAACCAT 

Y L S T Q S A L S K D P NEK R D H M V 
TACCTGTCCACA CAATCTGCCCTT TCGAAAGATCCC AACGAAAAGAGA GACCACATGGTC 

L L E P V T A A G I T H G M D E L Y N * 
CTTCTTGAGTTT GTAACAG CTGCT GGGATTACACAT GGCATGGATGAA CTGTACAACTAG 
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2. PROTEASE RECOGNITION SITES 



Substrate 

Recognitions 

Sequences 


Source 


Recognition Site 


SEQID 


Reference 


Caspase- 1,4,5 


peptide library 


5'(TGG.TTA)GAACATGACAA 
Seq:(W,UEHD/ 


53 
54 


Thornberry et al. f 1997, J. Biol. 
Chern. 272:17907 


proCaspasc- 1 


peptide library 


5'TGGTTTAAAGAC 
AA Seq. WFKD/ 


55 
56 


Thornherrv et nl 1QQ7 I rim 
Chem. 272:17907 


Caspasc-2 


peptide library 


5'GACGAACACGAC 
AA Seq: DEHD/ 


57 
58 


Thnmhfm/ M al IQQ7 1 Uin\ 

• ICUI nudity Cl 41,, 177/, J. 0IOI. 

Chem. 272:17907 


Caspasc J, 7 


PARP 


5*GACGAAGTTGAC 
AA Seq: DEVD/ 


59 
60 


Denote, ei at., ivy/. Biocnem 
Mol Biol InL 43:755-41; 
Thornberry et al., 1997, J. Biol. 
Chem. 272:17907 


ProCaspasc 3 


Caspase-3 


5'ATAGAAACAGAC 
AA Seq: IETD/ 


61 
62 


Tewari,M., elal., 1995. Cell. 

O I .01/1 ~7. 


ProCaspasc -4 ,5 


peptide library 


5'TGGGTAAGACAC 
AA Seq: WVRCV 


63 
64 


Thornberry, N.A. etal., 1997; 
JJBiol. Chem. 272. 17907-1791 1 


Caspasc 6 


Lamin A. 
peptide library 


5'GTAGAAATAGAC 
AA Seq: VE10/ 
5'GTAGAACACGAC 
AA Seq: VEHD/ 


65 
66 
&7 

VI 

68 


Nakajima and Sado. 1993. 
BiochimBiophysActa. 1171:311- 
4; Thornberry et al., 1997, J. Biol. 
Chem. 272:17907 


proCaspasc 6 


Caspase-6 


5'ACAGAAGTAGAC 
AA Seq: TEVD/ 


69 

7ft 


Fcmandes-Alncmri,etaK, 1994. J 
Biol Cnem. 26930761-4. 


proCnspasc-7 


peptide library 


5'ATACAAGCAGAC 
AA Seq: IOAO/ 


7! 
77 


Thornberry, N.A. el al., 1997, 
J.tnoi. Incm. 272 > 17907*17911 


Caspasc 8 


. peptide library 


5*GTAGAAACAGAC 
AAScq: VETO/, 


73 
74 


Muzio.M., etal., 1996. Cell. 
85:817-27; Femandes-Alnemri, et 
al.. 1 996. Proc Natl Acad Sci U S 
A. 93:7464-9;Thomberry et aJ., 
I vy7, J. Biol. Chem. 272:17907 


proCaspase-8 


Caspase-8 


5TTAGAAACAGAC 
AA Seq: LETD/ 


75 
76 


Muzio.M., etal., 1996. Cell. 
85:8 1 7-27; Fernandes-A lnemri, et 
ah, 1996. Proc Natl Acad Sci U S 
A. 93;7464-9;Thomberry et al., 
ivy/, j. Biol. Chem. 272:17907 


Caspase 9 


peptide library 


S'TTAGAACACGAC 
AA Seq: LEHD/ 


77 
78 


Thornberry, N.A. et al., 1 997, 
J.Biol. Chem. 272* 17907-1791 1 


proCaspasc 9* 


Caspase-9 


CCCGAACCCGAC 
PEPD 


79 


Thornberry, N.A. ct al., 1997, 
J.Biol. Chem. 272. 17907-1791 1 


HIV protease 




5*AGCCAAAATTAC 
AA Seq: SQNY/ 

5*CCAATAGTACAA 
AA Seq: PIVQ/ 


81 
82 

83 

0** 


Matayoshi, et al., 1990. Science. 
247:954-8. 


Adenovirus 
endopeptidase 




5'AUGTTTGGACGA 
AA Seq: MFGG/ 

5'GCAAAAAAAAGA 
AA Seq: AKKR/ 


85 
86 

87 

88 .. 


Weber and Tihanyi. 1994. 
Methods Enzymol. 244:595-604. 


b-Secretase 


Amyloid 
precursor 
protein 


5'GTAAAAAUG 
AA Seq. VKM/ 

5*GACGCAGAATTC 
DAEF/ 


89 
90 

91 
92 


Hardy et al., 1994, in Amyloid 
Protein Precursor in 
Development, Aging, and 
Alzheimer's Disease, ed. C.L. 
Masters etal.. pp. 190-198. 


Cathcpsin D 




S'AAACCAGCATTATTC 
AA Seq: KPALF 

5TTCAGA7TA 
AA Seq: FRL/ 


93 
94 

95 
96 


Dunn, et al., 1 998. Adv Exp Med 
Biol. 436:133-8. 


Matrix 

Metal loproieases 




S'GGACCATTAGGACCA 
AA Seq: GPLGP 


97 
98 


Bouvier el al., 1 993; Garbett ct 
al., 1999; Hill and Sakanari. 1997: 
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Kojirru et a]., 1998; Tyogi et at., 
1995; Wilhelmctal., 1993; 
Williams and AuUl, 1986; 
Haugland, R., Handbook of 
fluorescent probes and research 
Chemicals 7th ed. 




pcpiiue Horary 


5 ATAGAACCAGAC 
AA Seq: IEPD/ 


99 
100 


Thornbcny et al., 1997, J. Biol. 
Chem. 272:17907 


Anthrax protease 


MEKl 


5'ATGCCCAAGAAGAAGCCGAC 
GCCCATCCAGCTGAACCC 


101 


Vitaleetal.,0998) Bioehem 
Biophys Res Commun 248 (3), 
706-711 






AA Seq: MPKKKPTPIOLN 


102 






5'ATGCTGGCCCGGAGGAAGCCG 

GTGCTGCCGGCGCTCACCATCA 

ACCC 

A A Seq: MLARRKPVLPALT1N 


103 
104 


Vitaleeial.,(l998) Bioehem 
Biophys Res Commun 248 (3), 
706-71 1 


tetanus/botulinum 


ccllubrevin 


S'GCCTCGCAGVn GAAACA 
AA Seo: ASQFET 


105 
106 


McMahon et at., Nature 364046- 
349; Martin et al., J. Cell Biol. In 
press 




synaptobrevin/ 
VAMP3 


i 1 1 cn CAa i I IUAAACG 
A A aeq: ASQFcT 


107 
108 


Schiavoetal., (1992) Nature 
359,832-5 


Botulinum 

n^tirntnvin A 

lIEUIULUAill fx 


SNAP-25 


5*GCCAACCAACGTGCAACA 
AA Seq; ANO/RAT 


109 

no 


Zhao, et al. Gene 145 (2), 3 13- 
314(1994) 


Botulinum 
neurotoxin B 


VAMP 


5'GCTTCTCAA ITI UAAACG 
AA Seq: ASO/FET 


111 
1 12 




Botulinum 
neurotoxin C 


Syntaxin 


5 * ACGA A AA AAGCTGTG AAA 
AA Seq: TKK/AVK 


113 
114 


Martin et at., J. Leukoc. Biol. 65 
(3), 397-406 (1999) 


Botulinum 
neurotoxin D 


VAMP 


5 *G ACC AG A AGCTCTCTG AG 
AA Seq: DOK/LSE 


115 
116 




Botulinum 
neurotoxin E 


SNAP-25 


5 'ATCGACAGGATCATGGAG 
AA Seq: 1DR/IME 


117 
118 




Botulinum 
neurotoxin F 


VAMP 


5'AGAGACCAGAAGCTCTCT 
AA Seq: RDO/KLS 


119 
120 




Botulinum 
neurotoxin G 


VAMP 


5 ' ACG AGCGCAGCCAAGTTG 
AA Seq: TSA/AKL 


121 
122 
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3. PRODUCT/REACTANT TARGET SEQUENCES 



Target 


Target Source 


Target domain (Product or Reactant) 


SEQ ID 
NO 


Reference 


Cytoplasm/cytos 
kcleion 


Annexin 11 


i^TGTCTACTCTCCACGAAATCCTGTGCAAG 

CTCAGCTTGGAGGGTGTTCATTCTACACCCCC 

AAGTGCC3' 

(Amino acid seq:M STVHEILCKLSL 
EGVHSTPPSA) 


123 
124 


Eberhard, ct aL, 

IOO/} MaI Rinl 

Cell 8293a. 


Inner surface of 

plasma 

membrane 


famesylation 


5*AUGGGATCTACATTAAGCGCAGAAGACAA 
AGCAGCAGTAGAAAGAAGCAAAAUfiATAOA 

CAGAAACTTATTAAGAGAAGACGGAGAAAA 
AGCTGCTAGA3* 

(AAseq: M OCT.LS AEDKA A VER 
SK M 1 DRN-L'REDO EK A A R 


125 
126 


Ferruccio G. et aL, 
J. uiol. Cnem. 274, 
5843-5850, 1999 


Nucleus 


NFkB p50 


S'AGAAGGAAACGACAAAAG 
(AAseq:R R K R Q K) 


197 

128 


Henkel Tefal 
Cell 68,1121- 
1133, 1992 


Nucleolus 


NOLP 


5'AGAAAACGTATACGTACTTACCTCAAGTCC 
TGCAGGCGGATGAAAAGAAGTGGTTTTGAGA 
TGTCTCGACCTATTCCTTCCCACCTTACT 

(AAseq: R K R 1 R T Y L K S C R RM K 
RSGFEMSRPIPSHLT) 


129 
130 


Ueki,etaL, 1998. 
Biochcm Biophys 
ResCommun. 


Mitochondria 


cytochrome c 
oxidase 


5*ATGTCCGTCCTGACGCCGCTGCTGCTGCGG 
GGCTTGACAGGCTCGGCCCGGCGGCTCCCAG 
TGCCGCGCGCTAAGATCCATTCGTTG 

(AA Seq: M S V L T P L L L R G LT G S 
ARRLPVPRALIHSL) 


131 
132 


Rizzuto, et 'aL, 

1989.JBiolChem. 

264:10595-600. 


Nuclear Envelope 


ODV-E66& 


5'AUGAGCATTGTTTTAATAATTGTTATTTGGA 


133 


Hong, T, et aL 




ODV-E25 


1 1 1 { 1 i I AATATGTTTTTTATATTTA AGCAACA 
GCAAAGATCCCAGAGTACCAGTTGAATTAAU 
G 

(AA Seq: M S 1 V L 11 V 1 V V I F L 1 C F 
LYLSNSKD PR V PVELM) 


134 


PNAS, 94, 4050- 
4055, 1997 


Colgi 


Calrcticulin 


5*ATGAGGCTTCGGGAGCCGCTCCTGAGCGGC 

AGCGCCGCGATGCCAGGCGCGTCCCTACAGC 

GGGCCTGCCGCCTGCTCGTGGCCGTCTGCGCT 

CTGCACCTTGGCGTCACCCTCGTTTACTACCT 

GGCTGCkXTGCGACCTGAGGCGCCTGCCCCAA 

CTGGTCGGAGTCTCCACACCGCTGCAGGGCG 

QCTCGAACAGTGCCGCXlGCCATcnnnnA rrrr* 

CTCCGGGGAGCTCCGGACCGGAGGGGCC 

f A A Sen* M R 1 Rppr 1 C <"! c a a u d 

G A SLQRACRLLVAVCALHLGVTL 
VYYLAGRDLSRLPQLVGVSTPLQG 
GSNSAAAIGOSSGELRTGGA) 


135 
136 


Fliegel, U, et aL, J. 
BioI.Chem 264, 
21522 21528, 
1989. 


Endoplasmic 
reticulum 


D-AKAP1 


5'GAAACAATAAGACCTATAAGAAGATGTAGT 

ACATTTACATCTACAGACAGCAAAAUGGCAA 

TTCAATTAAGATCTCCCTTTCCATTAGCATTA 

CCAGGAAUGTTAGCTTTATTAGGATGGTGGT 

GGTTTTTCAGTAGAAAAAAA 

(AA Seq: ETIRPIRIRRCSYFTSTDSKM 

AIQLRSPFPLALPGMLALLGWWW 

FFSRKK 


137 
138 


Huang, U. Et ai., 
J. Cell. Biol. 145, 
951-959,1999 


Nuclear Export 


MEK1 


5 ' GCCTTGCAGAAGAAGCTGGAGGAGCT . 
AGAGCTTGATGAG 


139 


Fukuda.(1997) 
J. Biol. Chem 



FIGURE 29C 
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(AA SEQ: A LQKKLEELE 
IDE 


140 


272, 51.32642- 
32646 


Size exclusion 


PROJ domain of 
MAP4 


5 *GCCC ACCTCAGTCTTCTGGATCCCTTGACA 

GAACCACCTCCAGAAATTGAGGGAGAAATAA 

AGCGAGACTTCATCGCTCCGCTGGAGGCAGA 

GCCCTATGATGACATCGTGGGAGAAACTGTO 

GAGAAAACTGAGTTTATTCCTCTCCTGGATGG 

TGATGAGAAAACCGGGAACTCAGAGTCCAAA 

AAGAAACCCTGCTTAGACACTAGCCAGGTTG 

AAGGTATCCCATCTTCTAAACCAACACTCCTA 

GCCAATGGTGATCATGGAATGGAGGGGAATA 

ACACTGCAGGGTCTCCAACTGACTTCCTTGAA 

GAGAGAGTGGACTATCCGGATTATCAGAGCA 

GCCAGAACTGGCCAGAAGATGCAAGCTTTTG 

TTTCCAGCCTCAGCAAGTGTTAGATACTGACC 

AGGCTGAGCCCTTTAACGAGCACCGTGATGA 

TGGTTTGGCAGATCTGCTCTTTGTCTCCAGTG 

GACCCACGAACGCTTCTGCATTTACAGAGCG 

AGACAATCCTTCAGAAGACAGTTACGGTATG 

CTrCCCTGTGACTCATTTGCTTCCACGGCTGT 

TGTATCTCAGGAGTGGTCTGTGGGAGCCCCA 

AACTCTCCATGTTCAGAGTCCTGTGTCTCCCC 

AGAGGTTACTATAGAAACCCTACAGCCAGCA 

ACAGAGCTCTCCAAGGCAGCAGAAGtGGAAT 

CAGTGAAAGAGCAGCTGCCAGCTAAAGCATT 

GGAAACGATGGCAGAGCAGACCACTGATGTG 

GTGCACTCTCCATCCACAGACACAACACCAG 

GCCCAGACACAGAGGCAGCACTGGCTAAAGA 

CATAGAAGAGATCACCAAGCCAGATGTGATA 

TTGGCAAATCTCACGCAGCCATCTACTGAAT 

CGGATATGTTCCTGGCCCAGGACATGGAACT 

ACTCACAGGAACAG AGGCAGCCCACGCTAAC 

AATATCATATTGCCTACAGAACCAGACGAAT 

CTTCAACCAAGGATGTAGCACCACCTATGGA 

AGAAGAAATTGTCCCAGGCAATGATA 

(AA SEQ: ADLSLVDALTEPPPEIEGE1 

KRDFMAALEAEPYDDIVGETVEJCT 

E F J P LLDG DEKTGNSESKKKPCLD 

TSQVEGIPSSKPTLLANGDHGMEG 

NNTAGSPTDFLEERVDYPDYQSS 

QNWPEDASFCFQPQQVLDTDQAE 

PFNFHRDDGLADLLFVSSGPTNAS 

A FTERDN PSEDSYGMLPCDSFAST 

A VVSQEWSVG APNSPCSESC VSP 

E VTI ETLQPATELSKA AEVES VKEQ 

LPAKALETMAEQTTDVVHSPSTDT 

TPGPDTEAALAKD1EE1TKPDVILA 

NVTQPSTESDMFLAQDMELLTGTE 

AAHANNIILPTEPDESSTKDVAPPM 

E E E I V PG N DTTS PICETETTLPI KM D 

LAPPEDVLLTKETELAPAKGMVSL 

S E I EE A L A KN.D VRS A El P V AQETV 

VS ETE V V LATE V V LPS DP 1 TTLTK. 

DVTLPLEAERPLVTDMTPSLETEM 

TXGKETAPPTETNLGMAKDMSPLP 

ESEVTLGKDVVlLPETfCVAEFNNV 

TPLSEEEVTSVKDMSPSAETEAPL 

A If Nl A HI HCntCl f\/r\ vi o&i Anient 

ammauuhmj I cLI VDNSM APASDL 
A LPLETKVATVPIKDKG 


141 
142 


West (1991). J 
Biol Chem 
266(32): 21886- 
96; Olson, K. R. 
(1995). J Cell 
Bio! 130(3): 639- 
50. 


Vesicle 
membrane 


Synaptobrevin 


5 ' ATGTGGGCAATCGGGATTACTGTTCT 
GGTTATCTTCATCATCATCATCATCGTG 
TGGGTTGTC 

(AA SEQ: MWAIGITVLV 
IPIIIIIVWVV) 


143 
144 


Schiavo et al. t 
(1992) Nature 
359,832-5 
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Vesicle 
membrane 


UciiuDrevin 


5 ' ATGTGGGCGATAGGGATCAGTGTCCT 
TGGTGTG 

(AA SEQ: MWAIGISVLV 
IIVIIIIVWC) 


145 
146 


McMahonet a)., 

• Matnra IflA'IAtl- 
PtdlUttt 

349; Martin etaL, 
J. Cell Bid. In 
press 


ixuuicoi cxpon 


McK2 


5 ' GACCTGCAGAAGAAGCTGGAGGAGCT 
GGAACTTGACGAG 


147 


Zheng and Guan. 
J. Biol.Cheia 
268:11435-11439, 






AA SEQ: D L Q K K L E E L E L D E 


148 


1993 . 


Peroxisome 


PX 


5 ' TCTAAACTG 
AA SEQ: SKL 


149 - 
150 


Ameiy et al. f 
Biocheia J. 
336:367-371 
(1998) 



Microtubules (MAP4) SEQ ID NOtlSl (Nucleic acid); SEQ ID NO:152 (amino acid) 



MAP4 : 

M A D L SLVD ALT.E PPPE IEGE 
ATGGCCGACCTC AGTCTTGTGGAT GCGTTGACAGAA CCACCTCCAGAA ATTGAGGGAGAA 
TACCGGCTGGAG TCAGAACACCTA CGCAACTGTCTT GGTGGAGGTCTT TAACTCCCTCTT 

IK.RD PMAA LEAB PYDD IVGE 
ATAAAGCGAGAC TTCATGGCTGCG CTGGAGGCAGAG CCCTATGATGAC ATCGTGGGAGAA 
TATTTCGCTCTG AAGTACCGACGC GACCTCCGTCTC GGGATACTACTG TAGCACCCTCTT 

TVER T E F I PLLD GDEK TGNS 
ACTGTGGAGAAA ACTGAGTTTATT CCTCTCCTGGAT GGTGATGAGAAA ACCGGGAACTCA 
TGACACCTCTTT TGACTCAAATAA GGAGAGGACCTA CCACTACTCTTT TGGCCCTTGAGT 

E S KK KPCL DTSQ VEGI PSSK 
GAGTC CAAAAA G AAACCCTGCTTA GACACTAGCCAG GTTGAAGGTATC CCATCTTCTAAA 
CTCAGGTTTTTC TTTGGGACGAAT CTGTGATCGGTC CAACTTCCATAG GGTAGAAGATTT 

P T L L . A. N G D H G M E G N N T • A G S P 
CCAACACTCCTA GCCAATGGTGAT CATGGAATGGAG GGGAATAACACT GCAGGGTCTCCA 
GGTTGTGAGGAT CGGTTACCACTA GTACCTTACCTC CCCTTATTGTGA CGTCCCAGAGGT 

T D PL E E R V DYPD YQSS QNWP 
ACTGACTTCCTT GAAGAGAGAGTG GACTATCCGGAT TATCAGAGCAGC CAGAACTGGCCA 
TGACTGAAGGAA CTTCTCTCTCAC CTGATAGGCCTA ATAGTCTCGTCG GTCTTGACCGGT 

EDAS FCFQ PQQV ii D T D Q A E P 
GAAGATGCAAGC TTTTGTTTCCAG CCTCAGCAAGTG TTAGATACTGAC CAGGCTG AG CCC 
CTTCTACGTTCG AAAACAAAGGTC GGAGTCGTTCAC AATCTATGACTG GTCCGACTCGGG 

FNE H RDD G LADL L F V S SG PT 
TTTAACG AG CAC CGTG ATG ATGGT TTGGCAGATCTG CTCTTTGTCTCC AGTGGACCCACG 
AAATTG CTCGTG GCACTACTACCA AACCGTCTAGAC GAGAAACAGAGG TCACCTGGGTGC 

NASA F TER DNPS E D S Y GM LP 
AACGCTTCTGCA TTTACAGAGCGA GACAATCCTTCA GAAGACAGTTAC GGTATGCTTCCC 
TTGCGAAGACGT AAATGTCTCGCT CTGTTAGGAAGT CTTCTGTCAATG CCATACGAAGGG 



I 
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CDSF ASTA VV S Q EW SV GAPN 
TGTGACTCATTT GCTTCCACGGCT GTTGTATCTCAG GAGTGGTCTGTG GGAGCCCCAAAC 
ACACTGAGTAAA CGAAGGTGCCGA CAACATAGAGTC CTCACCAGACAC CCTCGGGGTTTG 

SP CS ESCV S PE V T. IE T LQPA 
TCTCCATGTTCA GAGTCCTGTGTC TCCCCAGAGGTT ACTATAGAAACC CTACAGCCAGCA 
AGAGGTACAAGT CTCAGGACACAG AGGGGTCTCCAA TGATATCTTTGG GATGTCGGTCGT 

T E L S KAAE VESV K B Q L P A K A 
ACAGAGCTCTCC AAGGCAGCAGAA GTGGAATCAGTG AAAGAGCAGCTG CCAGCTAAAGCA 
TGTCTCGAGAGG TTCCGTCGTCTT CACCTTAGTCAC, TTTCTCGTCGAC GGTCGATTTCGT 

LETM AEQT TDVV HS P S T DTT 
TTGGAAACGATG GCAGAGCAGACC ACTGATGTGGTG CACTCTCCATCC ACAGACACAACA 
AACCTTTGCTAC CGTCTCGTCTGG TGACTACACCAC GTGAGAGGTAGG TGTCTGTGTTGT 

PGP D TEAA LAKD IEEI TKPD 
CCAGGCCCAGAC ACAGAGGCAGCA CTGGCTAAAGAC ATAGAAGAGATC ACCAAGCCAGAT 
GGTCCGGGTCTG TGTCTCCGTCGT GACCGATTTCTG TATCTTCTCTAG TGGTTCGGTCTA 

VILA NVTQ PS TE SDM P LAQD 
GTGATATTGGCA AATGTCACGCAG CCATCTACTGAA TCGGATATGTTC CTGGCCCAGGAC 
CACTATAACCGT TTACAGTGCGTC GGTAGATGACTT AGCCTATACAAG GACCGGGTCCTG 

MELL TGTE A A H A NN II LP T E 
ATGGAACTACTC ACAGGAACAGAG GCAGCCCACGCT AACAATATCATA TTGCCTACAGAA 
TACCTTGATGAG' TGTCCTTGTCTC CGTCGGGTGCGA TTGTTATAGTAT AACGGATGTCTT 

PDES ST KD VAPP MEEE IVPG 
CCAGACGAATCT TCAACCAAGGAT GTAGCACCACCT ATGGAAGAAGAA ATTGTCCCAGGC 
GGTCTGCTTAGA AGTTGGTTCCTA CATCGTGGTGGA .TACCTTCTTCTT TAACAGGGTCCG 

ND T T SPKE T E TT LP IK MD LA 
AATGATACGACA TCCCCCAAAGAA ACAGACACAACA CTTCCAATAAAA ATGGACTTGGCA 
TTACTATGCTGT AGGGGGTTTCTT TGTCTCTGTTGT GAAGGTTATTTT TACCTGAACCGT 

PPED V LLT KETE LA PA KGM V 
CCACCTGAGGAT GTGTTACTTACC AAAGAAACAGAA CTAGCCCCAGCC AAGGGCATGGTT 
GGTGGACTCCTA CACAATGAATGG TTTCTTTGTCTT GATCGGGGTCGG TTCCCGTACCAA 

SL SE I EEA LAKN DVRS A EI P 
TCACTCTCAGAA ATAGAAGAGGCT CTGGCAAAGAAT GATGTTCGCTCT GCAGAAATACCT 
AGTGAGAGTCTT TATCTTCTCCGA GACCGXTTCTTA CTACAAGCGAGA CGTCTTTATGGA 

VAQ E TVVS ET E-V V L A T E VV L 
GTGGCTCAGGAG ACAGTGGTCTCA GAAACAGAGGTG GTCCTGGCAACA GAAGTGGTACTG 
CACCGAGTCCTC TGTCACCAGAGT CTTTGTCTCCAC CAGGACCGTTGT CTTCACCATGAC 

P S D P I T T L T K D V T L P L E A E R 
CCCTCAGATCCC ATAACAACATTG ACAAAGGATGTG ACACTCCCCTTA GAAGCAGAGAGA 
GGGAGTCTAGGG TATTGTTGTAAC TGTTTCCTACAC TGTGAGGGGAAT CTTCGTCTCTCT 
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PLV T DMTP SL E T E M T L G K B T 
CCGTTGGTGACG GACATGACTCCA TCTCTGGAAACA GAAATGACCCTA GGCAAAGAGACA 
GGCAACCACTGC CTGTACTGAGGT AGAGACCTTTGT CTTTACTGGGAT CCGTTTCTCTGT 

AP P T ETNL GMAK D MSP LPES 
GCTCCACCCACA GAAACAAATTTG GGCATGGCCAAA GACATGTCTCCA CTCCCAGAATCA 
CGAGGTGGGTGT CTTTGTTTAAAC CCGTACCGGTTT CTGTACAGAGGT GAGGGTCTTAGT 

EVTL G K DV VI LP ET KV A E F N 
GAAGTGACTCTG GGCAAGGACGTG GTTATACTTCCA GAAACAAAGGTG GCTGAGTTTAAC 
CTTCACTGAGAC CCGTTCCTGCAC CAATATGAAGGT CTTTGTTTCC AC CGACTCAAATTG 

NVT P LS'EE EVTS VK DM SPSA 
AATGTGACTCCA CTTTCAGAAGAA GAGGTAACCTCA GTCAAGGACATG TCTCCGTCTGCA 
TTACACTGAGGT GAAAGTCTTCTT CTCCATTGGAGT CAGTTCCTGTAC AGAGGCAGACGT 

ET EA PL A K NADL H SGT ELIV 
GAAACAGAGGCT CCCCTGGCTAAG AATGCTGATCTG CACTCAGGAACA GAG CTG ATTGTG 
CTTTGTCTCCGA GGGGACCGATTC TTACGACTAGAC GTGAGTCCTTGT CTCGACTAACAC 

DNSM APAS DLAL P LET .KVAT 
GACAACAGCATG GCTCCAGCCTCC GATCTTGCACTG CCCTTGGAAACA AAAGTAG CAACA 
CTGTTGTCGTAC CGAGGTCGGAGG CTAGAACGTGAC GGGAACCTTTGT TTTCATCGTTGT 

VPIK DKG'T VQ TE E. .K P R EDSQ 
GTTCCAATTAAA GACAAAGGAACT GTACAGACTGAA GAAAAACCACGT GAAGACTCCCAG 
CAAGGTTAATTT CTGTTTCCTTGA CATGTCTGACTT CTTTTTGGTGCA CTTCTGAGGGTC 

LASM QHXG QSTV PPCT ASPE 
TTAGCATCTATG CAGCACAAGGGA CAGTCAACAGTA CCTCCTTGCACG GCTTCACCAGAA 
AATCGTAGATAC GTCGTGTTCCCT GTCAGTTGTCAT GGAGGAACGTGC CGAAGTGGTCTT 

PVKA AEQ M S TL P ID AP SPLE 
CCAGTCAAAGCT GCAGAACAAATG TCTACCTTACCA ATAGATGCACCT TCTCCATTAGAG 
GGTCAGTTTCGA CGTCTTGTTTAC AGATGGAATGGT TATCTACGTGGA AGAGGTAATCTC 

NL EQ KE T P GSQP SEPC SGVS 
AACTTAGAGCAG AAGGAAACGCCT GGCAGCCAGCCT TCTGAGCCTTGC TCAGGAGTATCC . 
TTGAATCTCGTC TTCCTTTGCGGA CCGTCGGTCGGA AGACTCGGAACG AGTCCTCATAGG 

RQEE A K A A VG-V T GND I TTPP 
CGGCAAGAAGAA GCAAAGGCTGCT GTAGGTGTGACT GGAAATGACATC ACTACCCCGCCA 
GCCGTTCTTCTT CGTTTCCGACGA CATCCACACTGA CCTTTACTGTAG TGATGGGGCGGT 

N K E P P p S P E K K A K P L A T T Q P 
AACAAGGAGCCA CCACCAAGCCCA GAAAAGAAAGCA AAGCCTTTGGCC ACCACTCAACCT 
TTGTTCCTCGGT GGTGGTTCGGGT CTTTTCTTTCGT TTCGCAAACCGG TGGTGAGTTGGA 

AKTS TSKA KTQP TSL,P K Q PA 
GCAAAGACTTCA ACATCGAAAGCC AAAACACAGCCC ACTTCTCTCCCT AAGCAACCAGCT 
CGTTTCTGAAGT TGTAGCTTTCGG TTTTGTGTCGGG TGAAGAGAGGGA TTCGTTGGTCGA 
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PT TS GGLN KK PM SLAS G SVP 
CCCACCACCTCT GGTGGGTTGAAT AAAAAACCCATG AGCCTCGCCTCA GGCTCAGTGCCA 
GGGTGGTGGAGA CCACCCAACTTA TTTTTTGGGTAC TCGGAGCGGAGT CCGAGTCACGGT 

A A P H K R P A AAT A T A R P S T L P 
GCTGCCCCACAC AAACGCCCTGCT GCTGCCACTGCT ACTGCCAGGCCT TCCACCCTACCT 
CGACGGGGTGTG TTTGCGGGACGA CGACGGTGACGA TGACGGTCCGGA AGGTGGGATGGA 

A R DV KPKP I TEA KV AE KR T S 
GCCAGAGACGTG AAGCCAAAGCCA ATTACAGAAGCT AAGGTTGCCGAA AAGCGGACCTCT 
CGGTCTCTGCAC TTCGGTTTCGGT TAATGTCTTCGA TTCCAACGGCTT TTCGCCTGGAGA 

PSKP SSA. P AL K P GPKT TPTV 
CCATCCAAGCCT TCATCTGCCCCA GCCCTCAAACCT GGACCTAAAACC ACCCCAACCGTT 
GGTAGGTTCGGA AGTAGACGGGGT CGGGAGTTTGGA CCTGGATTTTGG TGGGGTTGGCAA 

S KAT SPSTLVST GPSS RS PA 
I2^ CCACA TCTCCCTCAACT CTTGTTTCCACT GGACCAAGTAGT AGAAGTCCAGCT 
AGTTTTCGGTGT AGAGGGAGTTGA GAACAAAGGTGA CCTGGTTCATCA TCTTCAGGTCGA 

TTLP K RPT SI KT EGKP ADVK 
ACAACTCTGCCT AAGAGGCCAACC AGCATCAAGACT GAGGGGAAACCT GCTGATGTCAAA 
TGTTGAGACGGA TTCTCCGGTTGG TCGTAGTTCTGA CTCCCCTTTGGA CGACTACAGTTT 

RMTA KSA S A D L S RSKT TS AS 
AGGATGACTGCT AAGTCTGCCTCA GCTGACTTGAGT CGCTCAAAGACC ACCTCTGCCAGT 
TCCTACTGACGA TTCAGACGGAGT CGACTGAACTCA GCGAGTTTCTGG TGGAGACGGTCA 

SVKR NTTP TGAA PPAG MTST 
TCTGTGAAGAGA AACACCACTCCC ACTGGGGCAGCA CCCCCAGCAGGG ATGACTTCCACT 
AGACACTTCTCT TTGTGGTGAGGG TGACCCCGTCGT GGGGGTCGTCCC TACTGAAGGTGA 

H V K P M S A P S R S S G A L S V D K K 
CGAGTCAAGCCC ATGTCTGCACCT AGCCGCTCTTCT GGGGCTCTTTCT GTGGACAAGAAG 
GCTCAGTTCGGG TACAGACGTGGA TCGGCGAGAAGA CCCCGAGAAAGA CACCTGTTCTTC 

P TST KPSS SAP R V SRL A T T V 
CCCACTTCCACT AAGCCTAGCTCC TCTGCTCCCAGG GTGAGCCGCCTG GCCACAACTGTT 
GGGTGAAGGTGA TTCGGATCGAGG AGACGAGGGTCC CACTCGGCGGAC CGGTGTTGACAA 

SAPD LKSV RSK V GSTE NIKH 
TCTGCCCCTGAC CTGAAGAGTGTT CGCTCCAAGGTC GGCTCTACAGAA AACATCAAACAC 
AGACGGGGACTG GACTTCTCACAA GCGAGGTTCCAG CCGAGATGTCTT TTGTAGTTTGTG 

QPG G G RAK VEK K T E A A TTAG 
CAGCCTGGAGGA GGCCGGGCCAAA GTAGAGAAAAAA ACAGAGGCAGCT ACCACAGCTGGG 
GTCGGACCTCCT CCGGCCCGGTTT CATCTCTTTTTT TGTCTCCGTCGA TGGTGTCGACCC 

K P E P N A V T K A A G S. I A S A Q K P 
AAGCCTGAACCT AATGCAGTCACT AAAGCAGCCGGC TCCATTGCGAGT GCACAGAAACCG 
TTCGGACTTGGA TTACGTCAGTGA TTTCGTCGGCCG AGGTAACGCTCA CGTGTCTTTGGC 

PA CK V QIV SKKV S Y SH IQSK 
CCTGCTGGGAAA GTCCAGATAGTA TCCAAAAAAGTG AGCTj* CAGTCAT ATTCAATCCAAG 



WO 00/50872 PCT/US00/04794 



43/50 



GGACGACCCTTT CAGGTCTATCAT AGGTTTTTTCAC TCGATGTCAGTA TAAGTTAGGTTC 

CVS K D N I K H V P G C O N V Q I Q N 
TGTGTTTCCAAG GACAATATTAAG CATGTCCCTGGA TGTGGCAATGTT CAGATTCAGAAC 
ACACAAAGGTTC CTGTTATAATTC GTACAGGGACCT ACACCGTTACAA GTCTAAGTCTTG 

K KVD ISKV SSKC GSKA NI KH 
AAGAAAGTGGAC ATATCCAAGGTC TCCTCCAAGTGT GGGTGCAAAGCT AATATCAAGCAC 
TTCTTTCACCTG TATAGGTTCCAG AGGAGGTTCACA CCCAGGTTTCGA TTATAGTTCGTG 

K P G G G D V K I E S Q K L N F K E K A 
AAGCCTGGTGGA GGAGAJGTCAAG ATTGAAAGTCAG AAGTTGAACTTC AAGGAGAAGGCC 
TTCGGACCAGCT CCTCTACAGTTC TAACTTTCAGTC TTCAACTTGAAG TTCCTCTTCCGG 

QAKV GS.LD NV GH F P AG GAV K 
CAAGCCAAAGTG GGATCCCTTGAT AACGTTGGCCAC TTTCCTGCAGGA GGTGCCGTGAAG 
GTTCGGTTTCAC CCTAG GGAACTA TTGCAACCGGTG AAAGGACGTCCT CCACGGGACTTC 

TEG G GSEA L PCP G PP A GEEP 
ACTGAGGGCGGT GGCAGTGAGGCC CTTCCGTGTCCA GGCCCCCCCGCT GGGGAGGAGCCA 
TGACTCCCGCCA CCGTCACTCCGG GAAGGCACAGGT CCGGGGGGGCGA CCCCTCCTCGGT 

VI PE A A P . D R GAP TSAS GLSG 
GTCATCCCTGAG GCTGCGCCTGAC CGTGGCGCCCCT ACTTCAGCCAGT GGCCTCAGTGGC 
CAGTAGGGACTC CGACGCGGACTG GCACCGCGGGGA TGAAGTCGGTCA CCGGAGTCACCG 

H TTL S G G G DQRE P* QTL DSQI 
CACACCACCCTG TCAGGGGGTGGT GACCAAAGGGAG CCCCAGACCTTG GACAGCCAGATC 
GTGTGGTGGGAC AGTCCCCCACCA CTGGTTTCCCTC GGGGTCTGGAAC CTGTCGGTCTAG 

QETS I* 
CAGGAGACAAGC ATCTAA 
GTCCTCTGTTCG TAGATT 
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Signal 



Protease Recognition 
Sequence 



Rcactant Target 
Sequence 



B 



Signal 2 



Product Target 
Sequence . 



Protease Recognition 
Sequence 



Signal 1 I Reactant Target 
Sequence 



Signal 



Product Target 
Sequence 



Protease Recognition 
Sequence 



Reactant Target 
Sequence. 
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FIGURE 31 
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Fig 3. BHK cells transfected with DEVD-caspase biosensor. 
(A) Cells before stimulation of apoptosis. (B) Another field of 
cells after stimulation with 250 jig/ml cis-platin (4 h). 



WO 00/50872 



PCT/US00/04794 



47/50 



Signal 


Product Target 
Sequence 


Protease Recognition 
Sequence 


Reactant Target 
Sequence 





EGFP 


Nuclear 
Localization 
Signal 


Caspase-3 


Annexin 11(1-23) 
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Signal 


Product Target 
Sequence 


Protease Recognition 
. Sequence . 


Rjeactant Target 
Sequence 





EGFP 


Nucleolar 
Localization 
Signal 


Caspase-8 


Annexin 0(1-23) 
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Signal 1 


Product Target 
Sequence 


Protease 

Recognition 

Sequence 


signal? 


Reactant Target 
Sequence 











EYFP 


■ Nuclear 
Localization Site 


Caspase-3 


ECFP 


Annexin 11(1-23) 



Fig. 50. Top: General design of biosensor with reactant 
and product containing separate targeting and signal 
sequences . Bottom: Specific example of this Approach— 
Caspase 3 biosensor with reactant targeted to'cytoskeleton 
and product targeted to nucleus. . 
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Bra TNF 
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IL1 + Anisomycin 



* * * 




NFKB pp 38 
Fig. 36 Dual-labeling assay in two cell types with 3 drugs and 3 drug combinations. 
Treatments marked with an asterisk are different from controls at a 99% confidence 
level (p< 0.01). 
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SEQUENCE LISTING 



<110> Giuliano, Kenneth A. 
Kapur, Ravi 

<120> A System for Cell Based Screening 

<130> 97-022-L 

<140> To Be Assigned 
<141> Filed Herewith 

<160> 180 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1770 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (882) 

<220> 

<223> Description of Artificial Sequence: 
GFP-DEVD-Annexin II construct 



<400> 1 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 io is 

gtc gag ctg gac Qgc gac gta aac ggc cac aag ttc age gtg tec ggc 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 



96 



gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 



240 



80 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 
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ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



aac tac aac age cac aac gtc tat ate atg gec gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



act gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gat cat tct 
Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser 
275 280 285 



480 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtg cag etc gec gac cac tac cag cag aac ace ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 624 
Pro Val Leu Leu Pro Asp Asn. His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



672 



gtg acc gee gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct ggc gec ggc get gga gee gga get ggc gec gga gee 768 
Gly Leu Arg Ser Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala 
245 250 255 

gac gag gtg gac ggc gec ggc gec gat gaa gta gat ggc gec atg tct 816 
Asp Glu Val Asp Gly Ala Gly Ala Asp Glu Val Asp Gly Ala Met Ser 
260 265 270 



864 



aca ccc cca agt gec tat tgaatggtga geaagggega ggagctgttc 912 
Thr Pro Pro Ser Ala Tyr 



290 












accggggtgg 


tgcccatcct 


ggtcgagctg 


gaeggegacg 


. taaaeggeca 


caagttcagc 972 


gtgtccggcg 


agggegaggg 


cgatgccacc 


taeggcaage 


tgaccctgaa 


gttcatctgc 1032 


accaccggca 


agctgcccgt 


gccctggccc 


accctcgtga 


ccaccctgac 


etaeggegtg 1092 


cagtgettea 


gccgctaccc 


cgaccacatg 


aagcagcacg 


acttcttcaa 


gtccgccatg 1152 


cccgaaggct 


aegtccagga 


gcgcaccatc 


ttcttcaagg 


aegaeggcaa 


ctacaagacc 1212 


cgcgccgagg 


tgaagttcga 


gggcgacacc 


ctggtgaacc 


gcatcgagct 


gaagggcatc 1272 


gacttcaagg 


aggaeggcaa 


catcctgggg 


cacaagctgg 


agtacaacta 


caacagccac 1332 


aacgtctata 


teatggcega 


caagcagaag 


aaeggcatea 


aggtgaactt 


caagatccgc 1392 
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cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc 1452 
ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc 1512 
aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg 1572 
atcactctcg gcatggacga gctgtacaag tccggactca gatctggcgc cggcgctgga 1632 
gccggagctg gcgccggagc cgacgaggtg gacggcgccg gcgccgatga agtagatggc 1692 
gccatgtcta ctgtccacga aatcctgtgc aagctcagct tggagggtga tcattctaca 1752 
cccccaagtg cctattga 



1770 



<210> 2 
<211> 294 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
GFP-DEVD-Annexin II construct 

<400> 2 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 go 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 j, 0 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 i4o 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 "0 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu ' 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lvs Ser 
225 230 235 7 240 

Gly Leu Arg Ser Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala 
245 250 255 

Asp Glu Val Asp Gly Ala Gly Ala Asp Glu Val Asp Gly Ala Met Ser 
260 265 270 

Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser 
275 280 285 

Thr Pro Pro Ser Ala Tyr 
290 



<210> 3 
<211> 2439 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (2436) 

<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD - MAPKDM construct 



<400> 3 


atg 


gtg 


Met 


Val 


1 




gtc 


gag 


val 


Glu 


gag 


ggc 


Glu 


Gly 


tgc 


acc 


cys 


Thr 




50 


ttc 


ggc 


Phe 


Gly 


65 





10 15 



20 25 30 



35 40 



45 ' 



55 so 



70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 



48 



96 



144 



240 



288 



336 
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Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Glv 
H5 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tvr 
130 135 ■ 



140 



aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



145 150 



155 i 6 o 



480 



528 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 i7 5 • 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Glv Asd Glv 
180 185 igo 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag aag 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 23° 235 240 

gga gac gaa gtg gac gga gcc gac etc agt ctt gtg gat gcg ttg aca 
Gly Asp Glu Val Asp Gly Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

gaa cca cct cca gaa att gag gga gaa ata aag cga gac ttc atg get 
Glu Pro Pro Pro Glu lie Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 26S 270 

gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg gag 864 
Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu 
275 280 285 

aaa act gag ttt att cct etc ctg gat ggt gat gag aaa acc ggg aac 912 
Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa ggt 960 
Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 ' 320 

ate cca tct tct aaa cca aca etc eta gcc aat ggt gat cat gga atg 1008 
lie Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

gag ggg aat aac act gca ggg tct cca act gac ttc ctt gaa gag aga 1056 
Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 



S76 



624 



672 



720 



768 



816 
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340 345 350 

gtg gac tat ccg gat tat cag age age cag aac tgg cca gaa gat gca 1104 
Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

age ttt tgt ttc cag cct cag caa gtg tta gat act gac cag get gag 1152 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin . Ala Glu 
370 375 380 



ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt gtc 
Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 39S 400 



480 



ctg cca get aaa gca ttg gaa acg atg gca gag cag acc act gat gtg 
Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 



485 490 495 



gca ctg get aaa gac ata gaa gag ate acc aag cca gat gtg ata ttg 
Ala Leu Ala Lys Asp lie Glu Glu He Thr Lys Pro Asp Val He Leu 
515 520 



525 



1200 



tec agt gga ccc acg aac get tct gca ttt aca gag cga gac aat cct 1248 
Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 4i5 

tea gaa gac agt tac ggt atg ctt ccc tgt gac tea ttt get tec acg 1296 
Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

get gtt gta tct cag gag tgg tct gtg gga gec cca aac tct cca tgt 1344 
Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

tea gag tec tgt gtc tec cca gag gtt act ata gaa acc eta cag cca 13 92 
Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 

gca aca gag etc tec aag gca gca gaa gtg gaa tea gtg aaa gag cag 1440 
Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 



1488 



gtg cac tct cca tec aca gac aca aca cca ggc cca gac aca gag gca 1536 
Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 



1584 



gca aat gtc acg cag cca tct act gaa teg gat atg ttc ctg gee cag 1632 
Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 535 540 

gac atg gaa eta etc aca gga aca gag gca gee cac get aac aat ate 1680 
Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn He 
545 550 555 560 

ata ttg cct aca gaa cca gac gaa tct tea acc aag gat gta gca cca 1728 
He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 575 

cct atg gaa gaa gaa att gtc cca ggc aat gat acg aca tec ccc aaa 1776 
Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 
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gaa aca gag aca aca ctt cca ata aaa atg gac ttg gca cca cct gag 1824 
Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

gat gtg tta ctt acc aaa gaa aca gaa eta gec cca gec aag ggc atg 1872 
Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
€10 615 620 



gtt tea etc tea gaa ata gaa gag get ctg gca aag aat gat gtt cgc 
Val Ser Leu Ser Glu He. Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 
"5 630 635 640 



gtg gtc ctg gca aca gaa gtg gta ctg ccc tea gat ccc ata aca aca 
Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 
660 665 670 

ttg aca aag gat gtg aca etc ccc tta gaa gca gag aga ccg ttg gtg 
Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 



aca get cca ccc aca gaa aca aat ttg ggc atg gee aaa gac atg tct 
Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 



gaa gag gta acc tea gtc aag gac atg tct ccg tct gca gaa aca gag 
Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 



1920 



tct gca gaa ata cct gtg get cag gag aca gtg gtc tea gaa aca gag 1968 
Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
645 650 655 



2016 



2064 



acg gac atg act cca tct ctg gaa aca gaa atg acc eta ggc aaa gag 2112 
Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 



2160 



cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata ctt 2208 
Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val lie Leu 
725 730 735 

cca gaa aca aag gtg get gag ttt aac aat gtg act cca ctt tea gaa 2256 
Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 



2304 



get ccc ctg get aag aat get gat ctg cac tea gga aca gag ctg att 2352 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 

770 775 780 

gtg gac aac age atg get cca gec tec gat ctt gca ctg ccc ttg gaa 2400 

Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 800 

aca aaa gta gca aca gtt cca att aaa gac aaa gga tga 2439 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 4 
<211> 812 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
EYFP -DEVD - MAPKDM construct 

<400> 4 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
!4S 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His. Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 240 

Gly Asp Glu Val Asp Gly Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

Glu Pro Pro Pro Glu He Glu Gly Glu lie Lys Arg Asp Phe Met Ala 
260 265 270 

Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu 
275 280 . 285 
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Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

Glu Giy Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 

Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 400 

Ser Ser Gly Pro Thr* Asn Ala Ser Ala Phe Thr <31u Arg Asp Asn Pro 
405 410 415 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 . 425 430 

Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 

Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 480 

Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 

Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

Ala Leu Ala Lys Asp lie Glu Glu He Thr Lys Pro Asp Val He Leu 
515 520 525 

Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 - 535 540 

Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn lie 
545 550 ' 555 560 

lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 575 

Pro Met Glu Glu Glu lie Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 



Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
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610 615 



620 



Val Ser Leu Ser Giu He Glu Glu Ala Leu Ala Lys Asn Asp Val Arq 
625 630 635 640 

Ser Ala Glu lie Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
64 5 .650 655 

Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro lie Thr Thr 
660 665 670 

Leu - Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 

Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
70S 710 715 720 

Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 730 735 

Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 7 5 o 

Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 
770 775 780 

Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 ™> 795 800 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 5 
<211> 2439 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (2436) 

<220> 

<223> Description of Artificial Sequence: 
EYFP-DEAD-MAPKDM construct 



<400> 5 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 



IS 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val. Ser Gly 
20 25 30 
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gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 240 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 



80 



cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 



145 X50 155 



160 



288 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



4 80 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 IBS 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag ccc 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Pro 
225 230 235 240 

aga gac gaa gcc gac age gcc gac etc agt ctt gtg gat gcg ttg aca 768 
Arg Asp Glu Ala Asp Ser Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

gaa cca cct cca gaa att gag gga gaa ata aag cga gac ttc atg get 816 
Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 265 270 

gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg gag 864 
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Ala Leu Glu Ala Glu Pro Tyr Asp Asp lie Val Gly Glu Thr Val Glu 
275 280 285 

aaa act gag ttt att cct etc ctg gat ggt gat gag aaa ace ggg aac 912 
Lys Thr Glu Phe lie Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa ggt' 960 
Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Glv 
305 310 315 320 

ate cca tct tct aaa cca aca etc eta gec aat ggt gat cat gga atg 1008 
lie Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 33 5 

gag ggg aat aac act gca ggg tct cca act gac ttc ctt gaa gag aga 1056 
Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

gtg gac tat ccg gat tat cag age age cag aac tgg cca gaa gat gca 1104 
Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala ' 
355 360 365 

age ttt tgt ttc cag cct cag caa gtg tta gat act gac cag get gag 1152 
Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 

ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt gtc 1200 
Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 4 oo 

tec agt gga ccc acg aac get tct gca ttt aca gag cga gac aat cct 1248 
Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 415 

tea gaa gac agt tac ggt atg ctt ccc tgt gac tea ttt get tec acg 1296 
Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

get gtt gta tct cag gag tgg tct gtg gga gec cca aac tct cca tgt 1344 
Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

tea gag tec tgt gtc tec cca gag gtt act ata gaa ace eta cag cca 1392 
Ser Glu Ser Cys Val Ser Pro Glu Val Thr lie Glu Thr Leu Gin Pro 
450 455 46O 

gca aca gag etc tec aag gca gca gaa gtg gaa tea gtg aaa gag cag 1440 
Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 48O 

ctg cca get aaa gca ttg gaa acg atg gca gag cag ace act gat gtg 1488 
Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 

gtg cac tct cca tec aca gac aca aca cca ggc cca gac aca gag gca 1536 
Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

gca ctg get aaa gac ata gaa gag ate ace aag cca gat gtg ata ttg 1584 
Ala Leu Ala Lys Asp lie Glu Glu lie Thr Lys Pro Asp Val He Leu 
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515 520 525 

gca aat gtc acg cag cca tct act gaa teg gat atg ttc ctg gec cag 1632 

Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 

530 535 540 

gac atg gaa eta etc aca gga aca gag gca gee cac get aac aat ate 1680 

Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn lie 
545 550 555 560 



ata ttg cct aca gaa cca gac gaa tct tea acc aag gat gta gca cca 
lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 



565 570 575 



1728 



cct atg gaa gaa gaa att gtc cca ggc aat gat acg aca tec ccc aaa 1776 
Pro Met Glu Glu Glu lie Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

gaa aca gag aca aca ctt cca ata aaa atg gac ttg gca cca cct gag 1824 
Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

gat gtg tta ctt acc aaa gaa aca gaa eta gec cca gec aag ggc atg 1872 
Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
610 615 620 

gtt tea etc tea gaa ata gaa gag get ctg gca aag aat gat gtt cgc 1920 
Val Ser Leu Ser Glu lie Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 
625 630 635 640 

tct gca gaa ata cct gtg get cag gag aca gtg gtc tea gaa aca gag 1968 
Ser Ala Glu lie Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
645 650 655 

gtg gtc ctg gca aca gaa gtg gta ctg ccc tea gat ccc ata aca aca 2016 
Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 
660 665 670 

ttg aca aag gat gtg aca etc ccc tta gaa gca gag aga ccg ttg gtg 2064 
Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 

acg gac atg act cca tct ctg gaa aca gaa atg acc eta ggc aaa gag 2112 
Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

aca get cca ccc aca gaa aca aat ttg ggc atg gec aaa gac atg tct 2160 
Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 

cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata ctt 2208 
Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 730 735 

cca gaa aca aag gtg get gag ttt aac aat gtg act cca ctt tea gaa 2256 
Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 

gaa gag gta acc tea gtc aag gac atg tct ccg tct gca gaa aca gag 2304 
Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 
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get ccc ctg get aag aat get gat ctg cac tea gga aca gag ctg att 2352 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu lie 

770 775 780 

gtg gac aac age atg get cca gee tec gat ctt gca ctg ccc ttg gaa 2400 

Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 800 

aca aaa gta gca aca gtt cca att aaa gac aaa gga tga 2439 

Thr Lys Val Ala Thr Val Pro lie Lys Asp Lys Gly 
805 810 



<210> 6 
<211> 812 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
E YFP - DEAD - MAP KDM construct 

<400> 6 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly. Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
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195 



200 



205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Pro 
225 230 235 2 40 

Arg Asp Glu Ala Asp Ser Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 2 55 

Glu Pro Pro Pro Glu lie Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 265 270 

Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu 
275 280 285 

Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 



Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 



400 



Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 415 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 

Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 480 

Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 



Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He Leu 
515 520 525 
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Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 535 540 

Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn He 
545 550 555 560 

lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 575 

Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

Glu Thr Glu Thr Thr Leu Pro lie Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala. Pro Ala Lys Gly Met 
610 615 620 

Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 
625 630 635 640 

Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
64 5 650 655 

Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 
660 665 670 

Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 

Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 

Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 730 735 

Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 

Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 
770 775 780 

Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro' Leu Glu 
785 790 795 800 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 7 
<211> 864 
<212> DNA 

<213> Artificial Sequence 



16 



WO 00/50872 PCT/US00/04794 

<220>. 

. <221> CDS ' . 

<222> (1) . . (861) 

<220> 

<223> Description of Artificial Sequence: F25-MEK1 
construct 

<400> 7 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



48 



96 



gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
. 65 70 75 so 

egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



624 
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teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg.att aca cat ggc atg gat gaa ctg tac aac acc 720 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Thr 
225 230 235 2 40 

ggt atg ccc aag aag aag ccg acg ccc ate cag ctg aac ccg gee ccc 768 

Gly Met Pro Lys Lys Lys Pro Thr. Pro He Gin Leu Asn Pro Ala Pro 
245 250 255 



gac ggc tct gca gtt aac ggg acc age tct gcg gag acc aac ttg gag 

Asp Gly Ser Ala Val Asn Gly Thr Ser Ser Ala Glu Thr Asn Leu Glu 
260 265 270 

gec ttg cag aag aag ctg gag gag eta gag ctt gat gag cag cag tga 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu Gin Gin 
275 280 285 



<210> 8 
<211> 287 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: F25-MEK1 
construct 

<400> 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
B5 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



816 



864 
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Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu . Lys . Arg Asp His Met Val Leu Leu Glu Phe 
210 " 215 220 

Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Thr 
225 230 235 240 

Gly Met Pro Lys Lys Lys Pro Thr Pro lie Gin Leu Asn Pro Ala Pro 
245 250 255 

Asp Gly Ser Ala Val Asn Gly Thr Ser Ser Ala Glu Thr Asn Leu Glu 
260 265 270 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu Gin Gin 
275 280 285 



<210> 9 
<211> 876 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (873) 

<220> 

<223> Description of Artificial Sequence: F25-MEK2 
construct 

<400> 9 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 ' 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
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85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa" aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp . Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac acc 720 
Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Ash Thr 
225 230 235 240 

ggt atg ctg gee egg agg aag ccg gtg ctg ccg gcg etc acc ate aac 768 
Gly Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr lie Asn 
245 250 255 

cct acc ate gee gag ggc cca tec cct acc age gag ggc gec tec gag 816 
Pro Thr lie Ala Glu Gly Pro Ser Pro Thr Ser Glu Gly Ala Ser Glu 
260 265 270 

gca aac ctg gtg gac ctg cag aag aag ctg gag gag ctg gaa ctt gac 864 
Ala Asn Leu Val Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp 
275 280 285 

gag cag cag taa 876 
Glu Gin Gin 

290 * 



<210> 10 

<211> 291 

<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: F25-MEK2 
construct 

<400> 10 

Met Ala ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val. Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
3S 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 .75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 go 9S 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 16Q 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
!65 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Glv 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
X9S .200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Thr 
225 230 235 240 

Gly Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr He Asn 
245 250 255 

Pro Thr He Ala Glu. Gly Pro Ser Pro Thr Ser Glu Gly Ala Ser Glu 
260 265 270 

Ala Asn Leu Val Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp 
275 280 285 

Glu Gin Gin 
290 
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<210> 11 
<211> 889 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1)..(888) ' . _. 

<220> 

<223> Description of Artificial Sequence: Caspase 
3-DEVD-substrate construct 

<400> 11 

12 f?» c 9C f 33 H? a 9 f a 9 f a CtC ttc act 99 a -9" 9 tc "a att ctt 48 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 10 15 

9at 99t 9at 9tt 3aC " c cac aa 9 ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 

20 25 30 

r 39 £?* 2f a 99t 9at 9Ca aca tae gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 7 ° 75 80 

tlf. f aC CtC aag 39t 9CC atg ccc 9aa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 go 9S 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
H5 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 l40 



aac tat aac tea cac aat gta. tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 iso 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



480 



528 
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180 185 



190 



cct gtc ctt tta cca gacaac cat tac ctg tec aca caa .tct gec ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser. Lys Asp Pro Asn Glu Lys Arg Asp His . Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 720 
Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga agg aaa cga caa aag cga teg get gtt aaa tct gaa gga aag 768 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Val Lys Ser Glu Gly Lys 
245 250 255 

aga aag tgt gac gaa gtt gat gga att gat gaa gta gca agt act atg 816 
Arg Lys Cys Asp Glu Val Asp Gly He Asp Glu Val Ala Ser Thr Met 
260 265 270 

tct act gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gtt cat 864 
Ser Thr Val His Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val His 
275 280 285 

tct aca ccc cca agt acc egg ate c 889 
Ser Thr Pro Pro Ser Thr Arg He 
290 295 



<210> 12 
<211> 296 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 
3 -DEVD- substrate construct 

<400> 12 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 
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Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

lie Asp . Phe Lys Glu Asp Gly Asn ile Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly Ile Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro Val . Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly Ile Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Val Lys Ser Glu Gly Lys 
245 250 255 

Arg Lys Cys Asp Glu Val Asp Gly lie Asp Glu Val Ala Ser Thr Met 
260 265 270 

Ser Thr Val His Glu Ile Leu Cys Lys Leu Ser Leu Glu Gly Val His 
275 280 285 

Ser Thr Pro Pro Ser Thr Arg Ile 
290 2 95 



<210> 13 
<211> 846 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (846) 

<220> 

<223> Description of Artificial Sequence: Caspase 
6-VEID-substrate construct 

<400> 13 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu 
1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Ly 3 Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
, 50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 
.Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 ™ 75 00 

egg cat gac ttt ttc aag agt gec atg ccc gaa ggt tat gta cag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 



gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 
Val Lys Phe Glu Gly Asp Thr Leu Val Ash Arg lie Glu Leu Lys Glv 
115 120 



125 



210 215 



gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 



225 230 235 



240 



192 



240 



288 



384 



att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 

Ser ASP Pr ° ASn G1U LyS ^ ^ His Met Val Leu Leu Glu Phe 

220 



720 



gga aga agg aaa cga caa aag cga teg aca aga ctt gtt gaa att gac 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Thr Arg Leu Val Glu He Asp 
245 250 255 

aac agt act atg age aca gta cac gaa att tta tgt aaa tta age tta 
Asn Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu Ser Leu 
260 265 270 

gaa gga gta cac agt aca cca cca age gca 846 
Glu Gly Val His Ser Thr Pro Pro Ser Ala 



768 



816 
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275 280 



<210> 14 
<211> 282 
<212> PRT 

<213> Artificial Sequence 
<220> .' 

<223> Description of Artificial Sequence: Caspase 
6-VEID-substrate construct 

<400> 14 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 .185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Thr Arg Leu Val Glu He Asp 
245 250 255 
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Asn Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu 

260 265 270 

Glu Gly Val His Ser Thr Pro Pro Ser Ala 
275 280 



<210> 15 
<211> 876 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (876) 

<220> 

<223> Description of Artificial Sequence: Caspase 8-VETD 
construct 

<400> 15 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 is 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 4 80 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu. Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt . 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga age aaa cga caa aag cga teg tat gaa aaa gga ata cca gtt 768 
Gly Arg Ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly He Pro Val 
245 250 255 



gaa aca gac age gaa gag caa get tat agt act atg tct act gtc cac 
Glu Thr Asp Ser Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 
260 265 270 

gaa ate ctg tgc aag etc age ttg gag ggt gtt cat tct aca ccc cca 
Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 
275 280 285 



816 



864 



agt gee gga tec 876 
Ser Ala Gly Ser 
290 



<210> 16 

<211> 292 

<212> PRT 

<213> Artificial Sequence 
<220* 

<223> Description of Artificial Sequence: Caspase 8 -VETO 
construct 

<400> 16 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 is 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 
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Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr. Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
1*5 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly He Pro Val 
245 250 255 

Glu Thr Asp Ser Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 
260 265 270 

Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 
275 280 285 

Ser Ala Gly Ser 
290 



<210> 17 
<211> 906 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . (906) 

<220> 

<223> Description of Artificial Sequence: Cas 3-multiple 
DEVD construct 

<400> 17 

atg get age aaa'gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 io is 
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gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 so 



ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 



75 80 



aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
1^5 200 205 



240 



288 



egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



480 



528 



576 



624 



teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 ; 240 

gga aga agg aaa cga caa aag cga teg gca ggt gac gaa gtt gat gca 768 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 
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ggt gac gaa gtt gat gca ggt gac gaa gtt gat gca ggt gac gaa gtt 816 
Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

gac gca ggt agt act atg tct act gtc cac gaa ate ctg tgc aag etc 864 
Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 



age ttg gag ggt gtt cat tct aca ccc cca agt gec gga tec 
Ser Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
290 295 300 



<210> 18 
<211> 302 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cas 3-multiple 
DEVD construct 

<400> 18 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 l40 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
19 5 200 205 



906 
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Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 

Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu 
275 280 285 

Ser Leu Glu Gly Val His Ser Thr- Pro Pro Ser Ala Gly Ser 
290 295 300 



<210> 19 
<211> 906 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (885) 

<220> 

<223> Description of Artificial Sequence: Caspase 
8-multiple VETD construct 

<400> 19 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate . 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 so 



ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

egg cat gac ttt ttc aag agt gec atg ccc gaa ggt tat gta cag gaa 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



240 



288 



agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 
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gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



384 



att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 



cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gec ctt 
Pro Val Leu Leu. Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 



624 



teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



720 



gga aga agg aaa cga caa aag cga teg gca ggt gtt gaa aca gac gca 768 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Val Glu Thr Asp Ala 
245 250 255 

ggt gtt gaa aca gac gca ggt gtt gaa aca gac gca ggt gtt gaa aca 816 
Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr 
260 265 270 

gac gca ggt agt act atg tct act gtc cac gaa ate ctg tgc aag etc 864 
Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 

age ttg gag ggt gtt cat tct acacccccaa gtgeeggate c 906 
Ser Leu Glu Gly Val His Ser 
290 295 



<210> 20 
<211> 295 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 
8 -multiple VETD construct 

<400> 20 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
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1 5 10 



15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Val Glu Thr Asp Ala 
245 250 255 

Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr 
260 265 270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 

Ser Leu Glu Gly Val His Ser 
290 295 



<210> 21 
<211> 4833 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1) . . (4830) 

<220> . 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAP4-EBFP construct 

<400> 21 

12 c 9C f 39 2P 9 f 9 339 ° t9 ttC 3CC 999 9t 9 3tg ccc ate ctg 48 

Met Val Ser Lys Gly- Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

1 5 10 1S 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec gg C 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 



20 25 



30 



35 40 



45 



50 55 



60 



ttc ggc tac ggc ctg cag tgc ttc gee cgc tac ccc gac cac atg aaq 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 



75 80 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



85 90 



95 



130 13 5 



aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



145 150 



155 iso 



96 



gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate l 44 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 



tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 



240 



288 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr , He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
H5 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

140 



480 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 iso 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gec gee ggg ate act etc ggc.atg gac gag ctg tac aag aag 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 2 40 

gga gac gaa gtg gac gga atg gec gac etc agt ctt gtg gat gcg ttg 
Gly Asp Glu Val Asp Gly Met Ala Asp Leu Ser Leu Val Asp Ala Leu 
245 250 255 

aca gaa cca cct cca gaa att gag gga gaa ata aag cga gac ttc atg 
Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met 
260 265 270 



aac tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa 
Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 
305 310 315 320 



cct tea gaa gac agt tac ggt atg ctt ccc tgt gac tea ttt get tec 
Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser 
420 425 430 



672 



720 



768 



816 



get gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg 864 

Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val 
275 280 285 

gag aaa act gag ttt att cct etc ctg gat ggt gat gag aaa acc ggg 912 

Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 
290 295 300 



960 



ggt ate cca tct tct aaa cca aca etc eta gec aat ggt gat cat gga 1008 
Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 
325 330 335 

atg gag ggg aat aac act gca ggg tct cca act gac ttc ctt gaa gag 1056 
Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu. 
340 345 350 

aga gtg gac tat ccg gat tat cag age age cag aac tgg cca gaa gat 1104 
Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp 
355 360 365 

gca age ttt tgt ttc cag cct cag caa gtg tta gat act gac cag get 1152 
Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala 
370 375 380 

gag ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt 1200 
Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe 
385 390 395 400 

gtc tec agt gga ccc acg aac get tct gca ttt aca gag cga gac aat 1248 
Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn 
405 410 415 



1296 



acg get gtt gta tct cag gag tgg tct gtg gga gee cca aac tct cca 1344 
Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro 
435 440 445 
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tgt tea gag tec tgt gtc tec cca gag gtt act ata gaa acc eta cag 1392 
Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin 
450 455 460 

cca gca aca gag etc tec aag gca gca gaa gtg gaa tea gtg aaa gag 1440 
Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu ' 
465 470 475 48O 

cag ctg cca get aaa gca ttg gaa acg atg gca gag cag acc act gat 1488 
Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp 
485 490 495 

gtg gtg cac tct cca tec aca gac aca aca cca ggc cca gac aca gag 1536 
Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu 



500 505 



510 



gca gca ctg get aaa gac ata gaa gag ate acc aag cca gat gtg ata 1584 
Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val lie 
51 5 520 525 

ttg gca aat gtc acg cag cca tct act gaa teg gat atg ttc ctg gee 1632 
Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala 
530 535 540 

cag gac atg gaa eta etc aca gga aca gag gca gec cac get aac aat 1680 
Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn 
545 550 555 560 

ate ata ttg cct aca gaa cca gac gaa tct tea acc aag gat gta gca 1728 
He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala 
565 570 57S 

cca cct atg gaa gaa gaa att gtc cca ggc aat gat acg aca tec ccc 1776 
Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro 
580 585 590 

aaa gaa aca gag aca aca ctt cca ata aaa atg gac ttg gca cca cct 1824 
Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro 
595 600 605 

gag gat gtg-tta ctt acc aaa gaa aca gaa eta gec cca gee aag ggc 1872 
Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 
610 615 620 

atg gtt tea etc tea gaa ata gaa gag get ctg gca aag aat gat gtt 1920 
Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val 
625 «0 635 640 

cgc tct gca gaa ata cct gtg get cag gag aca gtg gtc tea gaa aca 
Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr 
645 650 655 



aca ttg aca aag gat gtg aca etc ccc tta gaa gca gag aga ccg ttg 
Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu 
675 680 685 



1968 



gag gtg gtc ctg gca aca gaa gtg gta ctg ccc tea gat ccc ata aca 2016 
Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr 
660 665 670 



2064 



gtg acg gac atg act cca tct ctg gaa aca gaa atg acc eta ggc aaa 2112 
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Val Thr Asp Met Thr Pro Ser L u Glu Thr Glu Met Thr Leu Gly Lys 
. 690 695 700 

gag aca get cca ccc aca gaa aca aat ttg ggc atg gec aaa gac atg 2160 

Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met 
705 710 715 



720 



tct cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata 
Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He 
725 730 735 



2208 



ctt cca gaa aca aag gtg get gag ttt aac aat gtg act cca ctt tea 2256 

Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser 

740 745 750 

gaa gaa gag gta acc tea gtc aag gac atg tct ccg tct gca gaa aca 2304 

Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr 

755 760 '. 765 

gag get ccc ctg get aag aat get gat ctg cac tea gga aca gag ctg 2352 

Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu . 

770 775 780 



att gtg gac aac age atg get cca gee tec gat ctt gca ctg ccc ttg 
He Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 
785 790 795 



800 



act tea aca teg aaa gee aaa aca cag ccc act tct etc cct aag caa 
Thr Ser Thr Ser Lys Ala Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin 



2400 



gaa aca aaa gta gca aca gtt cca att aaa gac aaa gga act gta cag 2448 
Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly Thr Val Glri 
805 810 815 

act gaa gaa aaa cca.cgt gaa gac tec cag tta gca tct atg cag cac 2496 
Thr Glu Glu Lys Pro Arg Glu Asp Ser Gin Leu Ala Ser Met Gin His 
820 825 830 

aag gga cag tea aca gta cct cct tgc acg get tea cca gaa cca gtc 2544 
Lys Gly Gin Ser Thr Val Pro Pro Cys Thr Ala Ser Pro Glu Pro Val 
835 840 845 

aaa get gca gaa caa atg tct acc tta cca ata gat gca cct tct cca 2592 
Lys Ala Ala Glu Gin Met Ser Thr Leu Pro lie Asp Ala Pro Ser Pro 
850 855 860 

tta gag aac tta gag cag aag gaa acg cct ggc age cag cct tct gag 2640 
Leu Glu Asn Leu Glu Gin Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 
865 870 875 880 

cct tgc tea gga gta tec egg caa gaa gaa gca aag get get gta ggt 2688 
Pro Cys Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala Val Gly 
885 890 895 

gtg act gga aat gac ate act acc ccg cca aac aag gag cca cca cca 2736 
Val Thr Gly Asn Asp He Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro 
900 905 910 

age cca gaa aag aaa gca aag cct ttg gec acc act caa cct gca aag 2784 
Ser Pro Glu Lys Lys Ala Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys 
915 920 925 



2832 
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930 935 940 

cca get ccc acc acc tct ggt ggg ttg aat aaa aaa ccc atg age etc 2880 

Pro Ala Pro Thr Thr Ser Gly Gly Leu Asn Lys Lys Pro Met Ser Leu 
945 950 .955 960 



gec tea ggc tea gtg cca get gee cca cac aaa cgc cct get get gee 
Ala Ser Gly Ser Val Pro Ala Ala Pro His. Lys Arg Pro Ala Ala Ala 
965 970 975 



aag cca att aca gaa get aag gtt gee gaa aag egg acc tct cca tec 
Lys Pro He Thr Glu Ala Lys Val Ala Glu Lys Arg Thr Ser Pro Ser 
995 1000 1005 

aag cct tea tct gec cca gee etc aaa cct gga cct aaa acc acc cca 
Lys Pro Ser Ser Ala Pro Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro 
1010 1015 1020 

acc gtt tea aaa gec aca tct ccc tea act ctt gtt tec act gga cca 
Thr Val Ser Lys Ala Thr Ser Pro Ser Thr Leu Val Ser Thr Gly Pro 
1025 1030 1035 1040 



agt gtt cgc tec aag gtc ggc tct aca gaa aac ate aaa cac cag cct 
Ser Val Arg Ser Lys Val Gly Ser Thr Glu Asn lie Lys His Gin Pro 
H55 H60 H65 



2928 



act get act gec agg cct tec acc eta cct gec aga gac gtg .aag cca 2976 
Thr Ala Thr Ala Arg Pro Ser Thr Leu Pro Ala Arg Asp Val Lys Pro 
980 985 990 



3024 



3072 



3120 



agt agt aga agt cca get aca act ctg cct aag agg cca acc age ate 3168 
Ser Ser Arg Ser Pro Ala Thr Thr Leu Pro Lys Arg Pro Thr Ser He 
1045 1050 1055 

aag act gag ggg aaa cct get gat gtc aaa agg atg act get aag tct 3216 
Lys Thr Glu Gly Lys Pro Ala Asp Val Lys Arg Met Thr Ala Lys Ser 
1060 1065 1070 

gec tea get gac ttg agt cgc tea aag acc acc tct gec agt tct gtg 3264 
Ala Ser Ala Asp Leu Ser Arg Ser Lys Thr Thr Ser Ala Ser Ser Val 
1075 1080 1085 

aag aga aac acc act ccc act ggg gca gca ccc cca gca 'ggg atg act 3312 
Lys Arg Asn Thr Thr Pro Thr Gly Ala Ala Pro Pro Ala Gly Met Thr 
1090 1095 1100 

tec act cga gtc aag ccc atg tct gca cct age cgc tct tct ggg get 3360 
Ser Thr Arg Val Lys Pro Met Ser Ala Pro Ser Arg Ser Ser Gly Ala 
1105 mo ins 112Q 

ctt tct gtg gac aag aag ccc act tec act aag cct age tec tct get 3408 
Leu Ser Val Asp Lys Lys Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala 
1125 mo H35 

ccc agg gtg age cgc ctg gec aca act gtt tct gee cct gac ctg aag 3456 
Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser Ala Pro Asp Leu Lys 
1140 1145 H50 



3504 



gga gga ggc egg gec aaa gta gag aaa aaa aca gag gca get acc aca 3552 
Gly Gly Gly Arg Ala Lys Val Glu Lys Lys Thr Glu Ala Ala Thr Thr 
H 7 0 1175 H80 
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get ggg aag cct gaa cct aat gca gtc act aaa gca gec gq C tec att «nn 
Ala Gly Lys Pro Glu Pro Asn Ala Val Thr Lys Ala Sla III Ser gj 36 °° 
1185 1190 1195 1200 . 

gcg agt gca cag aaa ccg cct get ggg aaa gtc cag ata gta tec aaa 3648 
Ala Ser Ala Gin Lys Pro Pro Ala Gly Lys Val Gin He Val Ser Lys 
1205 1210 1215 

aaa gtg age tac agt cat att caa tec aag tgt gtt tec aag gac aat 
Lys Val Ser Tyr Ser His He Gin Ser Lys Cys Val Ser Lys Asp Asn 
1220 1225 1230 



3696 



att aag cat gtc cct gga tgt ggc aat gtt cag att cag aac aag aaa 3744 
He Lys Has Val Pro Gly Cys Gly Asn Val Gin lie Gin Asn Lys Lys 
!235 i 2 40 1245 

gtg gac ata tec aag gtc tec tec aag tgt ggg tec aaa get aat ate 3792 
Val Asp He Ser Lys Val Ser Ser Lys Cys Gly Ser Lys Ala Asn lie 
1250 1255 i 260 

aag cac aag cct ggt gga gga gat gtc aag att gaa agt cag aag ttg 3840 
Lys His Lys Pro Gly Gly Gly Asp Val Lys lie Glu Ser Gin Lys Leu 
1265 1270 1275 1280 

aac ttc aag gag aag gee caa gec aaa gtg gga tec ctt gat aac gtt 3888 
Asn Phe Lys Glu Lys Ala Gin Ala Lys Val Gly Ser Leu Asp Asn Val 
1285 1290 1295 

ggc cac ttt cct gca gga ggt gec gtg aag act gag ggc ggt ggc agt 3936 
Gly His Phe Pro Ala Gly Gly Ala Val Lys Thr Glu Gly Gly Gly Ser 
1300 1305 1310 

gag gec ctt ccg tgt cca ggc ccc ccc get ggg gag gag cca gtc ate 3984 
Glu Ala Leu Pro Cys Pro Gly Pro Pro Ala Gly Glu Glu Pro Val He 
1315 1320 1325 

cct gag get gcg cct gac cgt ggc gee cct act tea gec agt ggc etc 4032 
Pro Glu Ala Ala Pro Asp Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu 
1330 1335 1340 

agt ggc cac ace acc ctg tea ggg ggt ggt gac caa agg gag ccc cag 4080 
Ser Gly His Thr Thr Leu Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin 
1345 "50 1355 1360 

acc ttg gac age cag ate cag gag aca age ate atg gtg age aag ggc 4128 
Thr Leu Asp Ser Gin He Gin Glu Thr Ser lie Met Val Ser Lys Gly 
1365 1370 1375 

9ag gag ctg ttc acc ggg. gtg gtg ccc ate ctg gtc gag ctg gac ggc 4176 
Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Lp Gly 
1380 1385 1390 

gac gta aac ggc cac aag ttc age gtg tec ggc gag ggc gag ggc gat 4224 
Asp Val Asn Gly His Lys Phe Ser Val Ser lly Glu Gly til g!v £p 
1395 1400 1405 

?i ? ™° 339 Ct9 aCC Ct9 aa 9 ttc atc acc acc ggc aag 4272 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 

14i0 1415 i4 2 o 
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ctg ccc gtg ccc tgg ccc acc etc gtg acc acc ctg acc cac ggc gtg 4320 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr His Gly Val 
1425 1430 1435 . 1440 

cag tgc ttc age cgc tac ccc.gac cac atg aag cag cac gac ttc ttc 4368 
Gin Cys Phe Ser Arg Tyr Pro Asp His . Met Lys Gin His Asp Phe Phe 
1445 1450 1455 

aag tec gee atg ccc gaa ggc tac gtc cag gag cgc acc ate ttc ttc 4416 
Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 
I 460 1465 1470 

aag gac gac ggc aac tac aag acc cgc gec gag gtg aag ttc gag ggc 4464 
Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
1475 1480 1485 

gac acc ctg gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag 4512 
Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu 
1490 1495 1500 

gac ggc aac ate ctg ggg cac aag ctg gag tac aac ttc aac age cac 4560 
Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
1505 1510 1515 1520 

aac gtc tat ate atg gee gac aag cag aag aac ggc ate aag gtg aac 4608 
Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
1525 1530 , 1535 

ttc aag ate cgc cac aac ate gag gac ggc age gtg cag etc gee gac 4656 
Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
1540 1545 1550 



cac tac cag cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc 
His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
1555 1560 1565 



4704 



gac aac cac tac ctg age acc cag tec gee ctg age aaa gac ccc aac 4752 
Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1570 1575 1580 

gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg acc gee gee ggg .4800 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1585 1590 1595 1600 

ate act etc ggc atg gac gag ctg tac aag tag 4833 
He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1605 1610 



<210> 22 
<211> 1610 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAP4-EBFP construct 

<400> 22 

Met Val . Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 
20 . 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 .95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 leo 

Gly He Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 . 240 

Gly Asp Glu Val Asp Gly Met Ala Asp Leu Ser Leu Val Asp Ala Leu 
245 250 255 

Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met 
260 265 270 

Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp lie Val Gly Glu Thr Val 
275 280 285 

Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 
290 295 300 



Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 
305 310 315 320 

Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 
325 330 335 
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Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu 
340 345 3 50 

Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp 
355 360 365 

Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala 
370 375 380 

Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe 
385 390 395 400. 

Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn 
405 410 415 

Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser 
420 425 430 

Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro 
435 440 445 

Cys Ser Glu Ser Cys Val. Ser Pro Glu Val Thr He Glu Thr Leu Gin 
450 455 460 

Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu 
465 470 475 480 

Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp 
485 490 495 

Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu 
500 505 510 

Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He 
515 520 525 

Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala 
530 535 540 

Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn 
54 5 550 555 560 

He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala 
565 570 575 

Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro 
580 585 590 

Lys Glu Thr Glu Thr Thr Leu t>ro He Lys Met Asp Leu Ala Pro Pro 
595 600 605 

Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 
610 615 620 

Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val 
625 «0 635 640 

Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr 
645 650 655 

Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro lie Thr 
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660 665 



670 



Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu 
675 680 6 85 

Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lvs 
690 695 700 

Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met 
705 710 715 720 

Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val lie 
7 25 730 735 

Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser 
7 40 745 • 750 

Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr 
755 760 765 

Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu 
770 775 780 

lie Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 
785 7 »0 795 soo 

Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly Thr Val Gin 
8 °5 810 815 

Thr Glu Glu Lys Pro Arg Glu Asp Ser Gin Leu Ala Ser Met Gin His 
820 825 830 

Lys Gly Gin Ser Thr Val Pro Pro Cys Thr Ala Ser Pro Glu Pro Val 
835 840 845 

Lys Ala Ala Glu Gin Met Ser Thr Leu Pro lie Asp Ala Pro Ser Pro 
850 855 860 

Leu Glu Asn Leu Glu Gin Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 
865 87 ° 875 880 

Pro Cys Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala . Val Gly 
885 890 895 

Val Thr Gly Asn Asp lie Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro 
900 90S 910 

Ser Pro Glu Lys Lys Ala Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys 
915 920 925 

Thr Ser Thr Ser Lys Ala Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin 
930 935 940 

Pro Ala Pro Thr Thr Ser Gly Gly Leu Asn Lys Lys Pro Met Ser Leu 
945 950 955 960 

Ala Ser Gly Ser Val Pro Ala Ala Pro His Lys Arg Pro Ala Ala Ala 
965 970 975 

Thr Ala Thr Ala Arg Pro Ser Thr Leu Pro Ala Arg Asp Val Lys Pro 
980 985 99 0 
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Lys Pro lie Thr Glu Ala Lys Val Ala Glu Lys Arg Thr Ser Pro Ser 
BBS 1000 100 5 

Lys Pro Ser Ser Ala Pro Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro 
1010 1015 1020 

Thr . Val Ser Lys Ala Thr Ser Pro Ser Thr Leu Val Ser Thr Gly Pro 
1025 1030 103 5 104 0 

Ser Ser Arg Ser Pro Ala Thr Thr Leu Pro Lys Arg Pro Thr Ser lie 
1045 1050 loss 

Lys Thr Glu Gly Lys Pro Ala Asp Val Lys Arg Met Thr Ala Lys Ser 
1060 1065 .1070 

Ala Ser Ala Asp Leu Ser Arg Ser Lys Thr Thr Ser Ala Ser Ser Val 
1° 7 5 1080 1085 

Lys Arg Asn Thr Thr Pro Thr Gly Ala Ala Pro Pro Ala Gly Met Thr 
1090 1095 noo 

Ser Thr Arg Val Lys Pro Met Ser Ala Pro Ser Arg Ser Ser Gly Ala 
H05 mo 1U5 1120 

Leu Ser Val Asp Lys Lys Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala 
1125 H30 i 135 

Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser Ala Pro Asp Leu Lys 
1140 H45 iiso 

Ser Val Arg Ser Lys Val Gly Ser Thr Glu Asn lie Lys His Gin Pro 
1155 H60 lies 

Gly Gly Gly Arg Ala Lys Val Glu Lys Lys Thr Glu Ala Ala Thr Thr 
1170 1175 H80 

Ala Gly Lys Pro Glu Pro Asn Ala Val Thr Lys Ala Ala Gly Ser He 
1185 1190 1195 1200 

Ala Ser Ala Gin Lys Pro Pro Ala Gly Lys Val Gin lie Val Ser Lys 
1205 1210 1215 

Lys Val Ser Tyr Ser His He Gin Ser Lys Cys Val Ser Lys Asp Asn - 
1220 1225 1230 

He Lys His Val Pro Gly Cys Gly Asn Val Gin He Gin Asn Lys Lys 
1235 1240 1245 

Val Asp He Ser Lys Val Ser Ser Lys Cys Gly Ser Lys Ala Asn He 
1250 1255 1260 

Lys His Lys Pro Gly Gly Gly Asp Val Lys He Glu Ser Gin Lys Leu 
1265 1270 1275 1280 

Asn Phe Lys Glu Lys Ala Gin Ala Lys Val Gly Ser Leu Asp Asn Val 
1285 1290 1295 

Gly His Phe Pro Ala Gly Gly Ala Val Lys Thr Glu Gly Gly Gly Ser 
1300 1305 1310 
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Glu Ala Leu Pro Cys Pro Gly Pro Pro Ala Gly Glu Glu Pro Val He 
1315 1320 1325 

Pro Glu Ala Ala Pro Asp Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu 
1330 1335 1340 

Ser Gly His Thr Thr Leu Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin 
1345 "50 1355 136O 

Thr Leu Asp Ser Gin lie Gin Glu Thr Ser He Met Val Ser Lys Gly 
1365 1370 1375 

Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 
1380 1385 1390 

Asp Val Ash Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
1395 1400 1405 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
"10 1415 1420 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr His Gly Val 
1425 . 1430 1435 1440 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
1445 1450 1455 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 
1460 1465 1470 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
1475 1480 1485 

Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu 
1490 1495 isop 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
1505 1510 1515 1520 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
1525 1530 1535 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
1540 1545 1550 

His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
1555 1560 1565 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1570 1575 1580 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1585 1590 1595 1600 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1^05 1610 



<210> 23 
<211> 978 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (978) 

<220> 

<223> Description of Artificial Sequence: 

GFP-nucleblus-Caspase 8-annexin II construct 

<400> 23 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 



1 5 io 



15 



tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 go 



gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 i7o 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



48 



gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 



192 



240 



egg cat gac ttt ttc aag agt gec atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 



384 



att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



480 



528 



576 



624 
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195 200 



205 



teg aaa gat ccc aac gaa aag aga gac cac atg gtc ett'ett gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Jf*'??*' 9 ?* 9 ? g att aCa Cat " C atg gat gaa ct 9 ta <= aac tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 

225 230 235 . 240 

gga aga aaa cgt. ata cgt act tac etc aag tec tgc agg egg atg aaa 768 
Gly Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

aga agt ggt ttt gag atg tct cga cct att cct tec cac ctt act cga 816 
Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro' Ser His Leu Thr Arg 
260 265 270 

teg gca ggt gtt gaa aca gac gca ggt gtt gaa aca gac gca ggt gtt 864 
Ser Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val 
275 280 285 

gaa aca gac gca ggt gtt gaa aca gac gca ggt agt act atg tct act 912 
Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Ser Thr Met Ser Thr 
290 295 300 



gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gtt cat tct aca 
Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr 



305 310 315 



320 



960 



ccc cca agt gee gga tec g?8 
Pro Pro Ser Ala Gly Ser 
325 



<210> 24 
<211> 326 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 8-annexin II construct 

<400> 24 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
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85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys. Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu . Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
14 5 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 -200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
22 5 230 235 240 

Gly Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 

Ser Ala Gly Val Glu Thr Asp Ala Gly Val Glu thr Asp Ala Gly Val 
275 280 285 

Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Ser Thr Met Ser Thr 
290 295 300 

Val His Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr 
3 °5 310 315 320 

Pro Pro Ser Ala Gly Ser 
325 



<210> 25 
<211> 948 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (948) 

<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 3-ahnexin II construct 
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<400> 25 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att rt-t- 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 52 Pro S III 
1 5 " 15 



35 40 



45 



? C ^ ^ 9 ? C aaa Ctg CCt gtt cca fc 99 cca aca <* a gtc act act 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 



60 



ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 



80 



100 105 



110 



H5 120 



125 



130 135 



140 



aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 



155 160 



gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 
Val Gin Leq Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
18 ° 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gec ctt 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
21° 215 220 

T a ff* ?f 9Ct 999 3tt aCa cat 39C at 9 9 at 9 aa ct 3 tac aac tec 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 

225 230 235 240 



48 



96 



, r 9at 99t 93t 9tt aaC 3gC Cac aag ttc tct gtc agt gga 

Val Glu Leu. Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 
20 25 30 

S 9 ?f o? t 9at 9Ca 303 taC " a aaa ctt acc ct 9 aa g ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 



192 



240 



288 



egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get qaa 33S 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 



gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 



r^l f 3C It" f 39 9 f a 9at " C aaC att ctg 9 ga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu- Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



480 



528 



576 



624 



672 



720 
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gga aga aaa cgt ata cgt act tac etc aag tec tgc agg egg atq aaa 76b 
Gly Arg Lys Arg lie Arg Thr Tyr Leu Lys Ser Cys A?J 2g Zl Lys 
245 250 255 

aga agt ggt ttt gag atg. tct cga cct att cct tec cac ctt act cga 816 
Arg Ser Gly Phe Glu Met Ser Arg Pro lie Pro Ser His Leu Thr Arc, 
260 265 270 

teg tat gaa aaa gga.ata'cca gtt gaa aca gac age gaa gag caa act as a 
Ser Tyr Glu Lys Gly lie Pro Val Glu Thr Asp Ser Glu Glu Gin Sa 
275 280 285 

tat agt act atg tct act gtc cac gaa ate ctg tgc aag etc age ttg 912 
Tyr Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu Ser Leu 
290 295 300 

gag ggt gtt cat tct aca ccc cca agt gee gga tec 
Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
305 310 315 

<210> 26 
<211> 316 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 3-annexin II construct 

<400> 26 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 l 0 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp. Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
"0 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 "0. . 155 iso 
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Gly He Lys Val Asn Phe Lys Thr Arg His Asn lle Glu As _ Q1 Ser 
165 170 17 g 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asn Glv 
180 i 8 5 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

Arg ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 

Ser Tyr Glu Lys Gly lie Pro Val Glu Thr Asp Ser Glu Glu Gin Ala 
275 280 285 

Tyr Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu 
290 295 300 

Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
305 310 3i 5 



<210> 27 
<211> 2088 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1041) 

<220> 

<223> Description of Artificial Sequences 
NLS-Fred25-synaptobrevin Construct 

<400> 27 

atg aga aga aaa cga caa aag get age aaa gga gaa gaa etc ttc act 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 

1 5 10 



15 



48 



2?I v ? t ^ 9tt tta 9at 9gt 9at Stt aac 99° cac 96 

Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 

20 25 30 

aag ttc tct gtc agt gga gag ggt gaa ggt gat gca aca tac gga aaa 144 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 45 

ctt acc ctg aag ttc ate tgc act act ggc aaa ctg cct gtt cca tgg 192 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 



S2 
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cca aca eta gtc act act ctg tgc tat ggt gtt caa' tgc ttt tea aga 
Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

tac ccg gat cat atg aaa egg cat gac ttt ttc aag agt gec atg ccc 
Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 



PCIYUS00/04794 
240 



gga cac aaa ttg gaa tac aac tat aac tea cac aat gta tac ate atg 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 



145 iso 



155 160 



288 



gaa ggt tat gta cag gaa agg acc ate ttc ttc aaa gat gac ggc aac 336 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 no 

tac aag aca cgt get gaa gtc aag ttt gaa ggt gat acc ctt gtt aat 3 84 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

115 120 125 

aga ate gag tta aaa ggt att gac ttc aag gaa gat ggc aac att ctg 432 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 135 140 



480 



gca gac aaa caa aag aat gga ate aaa gtg aac ttc aag acc cgc cac 528 
Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
165 170 175 

aac att gaa gat gga age gtt caa eta gca gac cat tat caa caa aat 576 
Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 185 190 

act cca att ggc gat ggc cct gtc ctt tta cca gac aac cat tac ctg 624 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

tec aca caa tct gec ctt teg aaa gat ccc aac gaa aag aga gac cac 672 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Prp Asn Glu Lys Arg Asp His 
210 215 220 

atg gtc ctt ctt gag ttt gta aca get get ggg att aca cat ggc atg 720 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met 
225 230 235 240 

gat gaa ctg tac aac acc ggt atg tct aca ggt cca act get gee act 768 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Thr 
245 250 255 

ggc agt aat cga aga ctt cag cag aca caa aat caa gta gat gag gtg 816 
Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 
260 265 270 

gtg gac ata atg cga gtt aac gtg gac aag gtt ctg gaa aga gac cag 864 
Val Asp He Met Arg Val Asn Val Asp Lys Val Leu Glu Arg Asp Gin 
275 280 285 

aag etc tct gag tta gac gac cgt gca gac gca ctg cag gca ggc get 912 
Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu Gin Ala Gly Ala 
290 295 300 

tct caa ttt gaa acg age gca gee aag ttg aag agg aaa tat tgg tgg 960 
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Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg Lys Tyr Trp Tro 
305 310 315 32?- 

aag aat tgc aag atg tgg gca ate ggg att act gtt ctg gtt ate ttc 1008 
Lys Asn Cys Lys Met Trp Ala lie Gly He Thr Val Leu Val He Phe 
32 5 330 335 

ate ate ate ate ate gtg tgg gtt gtc tet tea tgaatgagaa gaaaacgaca 1061 
He He He He He Val Trp Val Val Ser Ser 
340 345 

aaaggctagc aaaggagaag aactcttcac tggagttgtc ccaattcttg ttgaattaga 1121 
. tggtgatgtt aacggccaca agttctctgt cagtggagag ggtgaaggtg atgeaacata 1181 
eggaaaaett accctgaagt tcatctgcac tactggcaaa ctgcctgttc catggccaac 1241 
actagtcact actctgtgct atggtgttca atgettttea agataccegg atcatatgaa 1301 
aeggcatgae tttttcaaga gtgccatgcc cgaaggttat gtacaggaaa ggaccatctt 1361 
cttcaaagat gaeggcaact acaagacacg tgctgaagtc aagtttgaag gtgataccct 1421 
tgttaataga atcgagttaa aaggtattga cttcaaggaa gatggcaaca ttctgggaca 1481 
caaattggaa tacaactata actcacacaa tgtatacatc atggcagaca aacaaaagaa 1541 
tggaatcaaa gtgaacttca agacccgcca caacattgaa gatggaagcg ttcaactagc 1601 
agaccattat caacaaaata ctccaattgg cgatggccct gtccttttac cagacaacca 1661 
ttacctgtcc acacaatctg ccctttcgaa agatcccaac gaaaagagag accacatggt 1721 
ccttcttgag tttgtaacag ctgctgggat tacacatggc atggatgaac tgtacaacae 1781 
eggtatgtet acaggtccaa ctgctgccac tggcagtaat cgaagacttc agcagacaca 1841 
aaatcaagta gatgaggtgg tggacataat gcgagttaac gtggacaagg ttctggaaag 1901 
agaccagaag ctctctgagt tagacgaccg tgeagacgea ctgcaggcag gcgcttctca 1961 
atttgaaacg agcgcagcca agttgaagag gaaatattgg tggaagaatt gcaagatgtg 2021 
ggcaatcggg attactgttc tggttatctt catcatcatc atcatcgtgt gggttgtctc 2081 
ttcatga 



2088 



<210> 28 
<211> 347 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
NLS-Pred25-synaptobrevin construct 

,<400> 28 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 10 



15 
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Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 3 0 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly. Asp Ala Thr Tyr Gly Lys 
35 .40 .45 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trn 
50 55 60 

Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 so as 

Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
100 105 . no 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 135 140 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 ISO 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
165 170 175 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
18 ° 185 190 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Thr 
245 250 255 

Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 
260 265 270 

Val Asp He Met Arg Val Asn Val Asp Lys Val Leu Glu Arg Asp Gin 
275 280 285 

Lys. Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu Gin Ala Gly Ala 
290 295 300 

Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg Lys Tyr Trp Tro 
305 310 315 320 

Lys Asn Cys Lys Met Trp Ala He Gly He Thr Val Leu Val He Phe 
325 330 335 

He He He He He Val Trp Val Val Ser Ser 
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340 345 



<210> 29 
<211> 2106 
<212> DNA 

<213> Artificial Sequence 

<220>. 

<221> CDS 

<222> (1) . . (1050) 

<220> 

<223> Description of Artificial Sequence: 
NLS-Fred25-cellubrevin construct 

<400> 29 

atg aga aga aaa cga caa aag get age aaa gga gaa gaa etc ttc act 4 8 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 io 



15 



gga gtt gtc cca att ctt gtt gaa tta gat ggt gat gtt aac ggc cac 96 
Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

aag ttc tct gtc agt gga gag ggt gaa ggt gat gca aca tac gga aaa 144 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 



35 40 



45 



ctt acc ctg aag ttc ate tgc act act ggc aaa ctg cct gtt cca tgg 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

cca aca eta gtc act act ctg tgc tat ggt gtt caa tgc ttt tea aga 
Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

tac ccg gat cat atg aaa egg cat gac ttt ttc aag agt gec atg ccc 
Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 so 95 



192 



240 



288 



gaa ggt tat gta cag gaa agg acc ate ttc ttc aaa gat gac ggc aac 336 
Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 no 

tac aag aca cgt get gaa gtc aag ttt gaa ggt gat acc ctt gtt aat 384 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

'aga ate gag tta aaa ggt att gac ttc aag gaa gat ggc aac att ctg 432 
Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu 
130 135 140 

gga cac aaa ttg gaa tac aac tat aac tea cac aat gta tac ate atg 480 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 160 

gca gac aaa caa aag aat gga ate aaa gtg aac ttc aag acc cgc cac 528 
Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn ' Phe Lys Thr Arg His 
165 170 175 
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576 



624 



672 



720 



aac att gaa gat gga age gtt caa eta gca gac cat tat caa caa aat 
Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp. His Tyr Gin Gin Asn 
180 - 185 190 

act cca att ggc gat ggc cet gtc ctt tta cca gac aac cat tac eta 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

tec aca caa tct gee ctt teg aaa gat ccc aac gaa aag aga gac cac 
ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

atg gtc ctt ctt gag ttt gta aca get get ggg att aca cat ggc atg 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met 
225 230 235 240 

ftl ft* f t9 I aC aa ° aCC 99t at9 tCt aca 99 fc 9*9 cct teg ggg. tea 768 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 

245 250 255 

agt get gec act ggc agt aat cga aga etc cag cag aca caa aat caa 
Ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin 
260 265 270 

?, ta ? at Sf 9 9t9 9tt 9aC atC at9 a 9 a 9 tc aat 9 to gat aag gtg tta 
Val Asp Glu Val Val Asp lie Met Arg Val Asn Val Asp Lys Val Leu 

275 280 285 

gaa aga gac cag aag etc teg gag eta gat gac cgc gca gat gca ctg 912 
Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
290 295 300 



816 



864 



960 



1008 



1050 



cag gca ggt gec teg cag ttt gaa aca agt get gee aag ttg aag aga 
Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 
305 310 31S 32 S 

aag tat tgg tgg aag aac tgc aag atg tgg gcg ata ggg ate agt gtc 
Lys Tyr Trp Trp Lys Asn Cys Lys Met Trp Ala He Gly lie Ser Val 
325 330 335 

ctg gtg ate att gtc ate ate ate ate gtg tgg tgt gtc tct 
Leu Val He He Val He He He He Val Trp Cys Val Ser 
340 345 350 

taaatgagaa gaaaacgaca aaaggctagc aaaggagaag aactcttcac tggagttgtc llio 

ccaattcttg ttgaattaga tggtgatgtt aacggccaca agttctctgt cagtggagag 1170 

ggtgaaggtg atgeaacata eggaaaaett accctgaagt tcatctgcac tactggcaaa 1230 

ctgcctgttc catggccaac actagtcact actctgtgct atggtgttca atgettttea 1290 

agataccegg atcatatgaa aeggcatgae tttttcaaga gtgccatgcc cgaaggttat 1350 

gtacaggaaa ggaccatctt cttcaaagat gaeggcaact acaagacacg tgctgaagtc 1410 

aagtttgaag gtgataccct tgttaataga atcgagttaa aaggtattga cttcaaggaa 1470 

gatggcaaca ttctgggaca caaattggaa taeaactata actcacacaa tgtatacatc 1530 

atggcagaca aacaaaagaa tggaatcaaa gtgaacttca agacccgcca caacattgaa 1590 
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gatggaagcg ttcaactagc agaccattat caacaaaata ctccaattgg cgatggccct 1650 
gtccttttac cagacaacca ttacctgtcc acacaatctg ccctttcgaa agatcccaac 1710 
gaaaagagag accacatggt ccttcttgag tttgtaacag ctgctgggat tacacatggc 1770 
atggatgaac tgtacaacac cggtatgtct acaggtgtgc cttcggggtc aagtgctgcc 1830 
actggcagta atcgaagact ccagcagaca caaaatcaag tagatgaggt ggttgacatc 1890 
atgagagtca atgtggataa ggtgttagaa agagaccaga agctctcgga gctagatgac 1950 
cgcgcagatg cactgcaggc aggtgcctcg cagtttgaaa caagtgctgc caagttgaag 2010 
agaaagtatt ggtggaagaa ctgcaagatg tgggcgatag ggatcagtgt cctggtgatc 2070 
attgtcatca tcatcatcgt gtggtgtgtc tcttaa 2 106 

<210> 30 
<211> 350 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
NLS-Fred25-cellubrevin construct 

<400> 30 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 io .15 

Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 45 

Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 no 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu 
130 135 140 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 160 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
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165 



170 175 



Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 185 190 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215. 220 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 , 230 235 240 

Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 
245 250 255 

Ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin 
260 265 270 

Val Asp Glu Val Val Asp He Met Arg Val Asn Val Asp Lys Val Leu 
275 280 285 

Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg; Ala Asp Ala Leu 
290 295 300 

Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 
305 315 32? 

Lys Tyr Trp Trp Lys Asn Cys Lys Met Trp Ala lie Gly He Ser Val 
325 330 335 

Leu Val He He Val He He He He Val Trp Cys Val Ser 
340 345 350 

<210> 31 
<211> 3171 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (3168) 

<220> 

<223> Description of Artificial Sequence: 
NLS-EYFP-MAPKDM-EBFP construct 

<400> 31 

2 ^ CC a9a aga aag gtg a 3° aa 9 99C gag gag ctg ttc acc ggg 48 
Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
1 5 10 15 

£? S? T C Ct9 9t ° 939 Ct " ac g 9 c 9 ac * ta aac 99C cac aag 96 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

20 25 30 

ttc age gtg tec ggc gag ggc gag ggc gat gec acc tac ggc aag ctg 144 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr G?y Lys Leu 
35 40 45 
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acc ctg aag ttc ate tgc ace acc ggc aag ctg ccc gtg ccc tgg ccc 192 
Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
50 . 55 so 



acc etc gtg acc acc ttc ggc tac ggc ctg cag tgc ttc gec cgc tac 
Thr Leu Val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr 



65 70 75 



80 



ccc gac cac atg aag cag cac gac ttc ttc aag tec gee atg ccc gaa 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 



85 90 



95 



gtc ctg ctg gag ttc gtg acc gec gee ggg ate act etc ggc atg gac 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
225 230 235 240 



gtg gat gcg ttg aca gaa cca cct cca gaa att gag gga gaa ata aag 
Val Asp Ala Leu Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys 
260 265 270 



240 



288 



ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac tac 336 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
100 105 no 

aag acc cgc gee gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc 3 84 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
115 120 125 

ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg ggg 432 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
130 135 140 

cac aag ctg gag tac aac tac aac age cac aac gtc tat ate atg gec 480 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala 
145 150 155 160 

gac aag cag aag aac ggc ate aag gtg aac ttc aag ate cgc cac aac 528 
Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
165 170 175 

ate gag gac ggc age gtg cag etc gee gac cac tac cag cag aac acc 576 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
180 185 190 

ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age 624 
Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
195 200 205 

tac cag tec gec ctg age aaa gac ccc aac gag aag cgc gat cac atg 672 
Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
210 215 220 



720 



gag ctg tac aag aag gga gac gaa gtg gac gga gee gac etc agt ctt 768 
Glu Leu Tyr Lys Lys Gly Asp Glu Val Asp Gly Ala Asp Leu Ser Leu 
245 250 255 



816 



cga gac ttc atg get gcg ctg gag gca gag ccc tat gat gac ate gtg 864 
Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val 
275 280 285 



60 



WO 00/50872 w^™,^,^-. 

PCT/US00/04794 

gga gaa act gtg gag aaa act gag ttt att cct etc ctg gat qqt aat 912 
Gly Glu Thr Val Glu Lys Thr Glu. Phe lie Pro Leu Leu Asp Glv Ld 
290 295 300 



960 



1008 



gag aaa acc ggg aac tea gag tec aaa aag aaa ccc tgc tta gac act 
Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr 
305 310 315 3 20 

age cag gtt gaa ggt ate cca tct tct aaa cca aca etc eta gee aat 
Ser Gin Val Glu. Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn 
325 330 335 

ggt gat cat gga atg gag ggg aat aac act gca ggg tct cca act gac 1056 
Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp 
340 345 350 

ttl f" ni* n, g 9t ? 93C tat CC9 9at tat ca 9 a 9 c aqc ca 9 aac H04 
Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn 

355 360 3S5 

tgg cca gaa gat gca age ttt tgt ttc cag cct cag caa gtg tta gat ii 52 
Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp 
370 375 380 y 

act gac cag get gag ccc ttt aac gag cac cgt gat gat ggt ttg gca 1200 
Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala 
385 390 395 400 

gat ctg etc ttt gtc tec agt gga ccc acg aac get tct gca ttt aca 1248 
Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr 
405 410 415 

gag cga gac aat cct tea gaa gac agt tac ggt atg ctt ccc tgt gac 1296 
Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp 
420 425 430 

tea ttt get tec acg get gtt gta tct cag gag tgg tct gtg gga gee 1344 
Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala 
435 440 445 

cca aac tct cca tgt tea gag tec tgt gtc tec cca gag gtt act ata 1392 
Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 
450 455 460 

f£ C f ta Ca9 CCa 9Ca 3Ca 939 ctc tcc aaq qca 9 ca gtg gaa 1440 
Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu 

465 4 ™ 475 480 

tea gtg aaa gag cag ctg cca get aaa gca ttg gaa acg atg gca gag 1488 
Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu 
485 490 49 S 

n, 9 tu° ^ 93t 9t9 9t9 C3C tct Cca tcc aca qac aca aca cca ggc 1536 
Gin Thr Thr Asp . Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly 

500 505 510 

cca gac aca gag gca gca ctg get aaa gac ata gaa gag ate acc aag 1584 
Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys 
51 5 520 525 

cca gat gtg ata ttg gca aat gtc acg cag cca tct act gaa teg gat 1632 
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Pro Asp Val He Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp 
530 535 540 

atg ttc ctg gcc cag gac atg gaa eta etc aca gga aca gag gca gec 1680 
Met Phe Leu Ala Gin Asp Met Glu Leu . Leu Thr Gly Thr Glu Ala Ala 
545 550 5 55 560 

cac get aac aat ate ata ttg cct aca gaa cca gac gaa tct tea ace 1728 
His Ala Asn Asn He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr 
565 570 575 

aag gat gta gca cca cct atg gaa gaa gaa att gtc cca ggc aat gat 1776 
Lys Asp Val Ala Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp 
580 585 590 



acg aca tec ccc. aaa gaa aca gag aca aca ctt cca ata aaa atg gac 
Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp 
595 600 60S 



gat ccc ata aca aca ttg aca aag gat gtg aca etc ccc tta gaa gca 
Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 
675 680 685 



1824 



ttg gca cca cct gag gat gtg tta ctt acc aaa gaa aca gaa eta gcc 1872 
Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala 
610 615 620 

cca gcc aag ggc atg gtt tea etc tea gaa ata gaa gag get ctg gca 1920 
Pro Ala Lys Gly Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala 
625 630 635 640 

aag aat gat gtt cgc tct gca gaa ata cct gtg get cag gag aca gtg 1968 
Lys Asn Asp Val Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val 
645 650 655 

gtc tea gaa aca gag gtg gtc ctg gca aca gaa gtg gta ctg ccc tea 2016 
Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 
660 665 670 



2064 



gag aga ccg ttg gtg acg gac atg act cca tct ctg gaa aca gaa atg 2112 
Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met 
690 695 700 

acc eta ggc aaa gag aca get cca ccc aca gaa aca aat ttg ggc atg 2160 
Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met 
70S 710 715 720 

gtc aaa gac atg tct cca etc cca gaa tea gaa gtg act ctg ggc aag 2208 
Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Qlu Val Thr Leu Gly Lys 
725 730 ' 735 

gac gtg gtt ata ctt cca gaa aca aag gtg get gag ttt aac aat gtg 2256 
Asp Val Val He Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val 
740 745 750 

act cca ctt tea gaa gaa gag gta acc tea gtc aag gac atg tct ccg 2304 
Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 
755 760 765 

tct gca gaa aca gag get ccc ctg get aag aat get gat ctg cac tea 2352 
Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 
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775 



780 



gga aca gag ctg att gtg gac aac age atg get cca gec tec gat ctt 2400 
Gly Thr Glu Leu lie Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu 
785 790 795 * 800 

gca ctg ccc ttg gaa aca aaa gta gca aca gtt cca att aaa gac aaa 2448 
Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro He Lys Asd Lvs 
80S BIO 815 

gga atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate 2496 
Gly Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
820 825 830 

ctg gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec 2544 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
835 840 845 

ggc gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aaq ttc 2S92 
Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
850 855 ago 

ate tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc 2640 
lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
865 « 7 ° 875 880 

%l C f t9 11° CaC " C 9t9 Ca9 tgC ttc agc C 9 C tac ccc 9 ac «c atg 2688 
Thr Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 

885 890 895 

aag cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag 2736 
Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
900 905 910 

gag cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec 2784 
Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
915 920 925 

gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag 2832 
Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lvs 
930 935 940 

ggc ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag 2880 
Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 
945 950 955 960 

tac aac ttc aac age cac aac gtc tat ate atg gec gac aag cag aag 2928 
Tyr Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
965 970 97s 

aac ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc 2976 
Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
980 985 990 

agc gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac 3024 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
995 1000 1005 

ggc ccc gtg ctg ctg ccc gac aac cac tac ctg agc acc cag tec gec 3072 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
1010 1015 1020 
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ctg age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag 3120 
Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Qlu 
1025 1030 103S 1040 

ttc gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag 3168 
Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1045 1050 1055 



tag 



<210> 32 
<211> 1056 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
NLS-EYFP-MAPKDM-EBFP construct 

<400> 32 

Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
1 5 10 is 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lvs 
20 25 30 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
35 40 45 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
50 55 60 

Thr Leu Val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr 
65 70 75 80 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
85 90 95 

Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
100 105 110 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
H5 120 125 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
130 135 140 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
145 150 155 160 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
I 65 170 175 

lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
180 185 190 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
195 200 205 



3171 
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Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
210 215 220 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp 
225 230 2*0 ■ 

Glu Leu Tyr Lys Lys Gly Asp Glu Val Asp Gly Ala Asp Leu Ser Leu 
245 250 255 

Val Asp Ala Leu Thr Glu Pro Pro Pro Glu He Glu Gly Glu lie Lys 
260 265 270 

Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val 
275 280 285 

Gly Glu Thr Val Glu Lys Thr Glu Phe lie Pro Leu Leu Asp Gly Asp 
290 295 300 

Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr 
305 310 315 320 

Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn 
325 330 335 

Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asd 
340 345 3 5o 

Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn 
355 3S0 365 

Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp 
370 375 38 o 

Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala 
385 390 395 400 

Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr 
405 4io 415 

Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cy S A sd 
420 425 430 

Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala 
435 440 445 

Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 
450 455 46Q 

Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu 
465 470 475 480 

Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu 
485 490 495 

Gin Thr Thr Asp Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly 
500 505 510 

Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp lie Glu Glu lie Thr Lys 
515 520 525 

Pro Asp Val He Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp 
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530 



535 



540 



Met Phe Leu Ala Gin Asp Met Glu Leu Leii Thr Gly Thr Glu Ala Ala 
545 550 555 



560 



His Ala Asn Asn lie lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr 
565 570 575 

Lys Asp Val Ala Pro Pro Met Glu Glu Glu lie Val Pro Gly Asn Asp 



580 



585 



590 



Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp 
5*5 600 605 

Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala 
"0 615 620 

Pro Ala Lys Gly Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala 
625 "0 635 640 

Lys Asn Asp Val Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val 
645 650 655 

Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 
66 ° 665 670 

Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 
675 680 685 

Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met 
S9 0 695 700 

Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met 
705 710 715 720 

Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys 
725 730 735 

Asp Val Val He Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val 
740 745 750 

Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 
. 755 760 765 

Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 
770 775 780 

Gly Thr Glu Leu He Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu 
785 790 795 800 

Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys 
805 810 815 

Gly Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie 
820 825 830 

Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
835 840 845 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
850 855 860 
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He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 



865 870 875 



880 



. Thr Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
fl 85 890 895 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
900 905 910 

Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
915 920 925 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 
930 935 940 

Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 
945 950 955 960 

Tyr Asn Phe Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys 
965 970 975 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
980 985 99Q 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
995 1000 1005 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
101 0 1015 1620 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
1025 1030 1035 1040 

Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1045 1050 1055 

<210> 33 
<211> 1623 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1623) 

<220> 

<223> Description of Artificial Sequence: 

YFP-NLS-CP3-multiple DEVD-CFP-Annexin II construct 

<400> 33 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
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Glu Gly Glu Gly Asp Ala Thr. Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg. cag tgc ttc gec cgc tac ccc gac cac atg aag 240 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 g5 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gaq 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 ■ 140 

aac tac aac age cac aac gtc tat ate atg gec gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gec gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Glv 
180 185 i9o 



ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gec ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
I 95 200 205 



220 



624 



age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 



gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga aga agg aaa cga caa aag cga teg gca ggt gac gaa gtt gat gca 768 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 

ggt gac gaa gtt gat gca ggt gac gaa gtt gat gca ggt gac gaa gtt 816 
Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

gac gca ggt agt act atg gtg age aag ggc gag gag ctg ttc acc ggg 864 
Asp Ala Gly Ser Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
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275 280 2 85 



gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac aag 912 
Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn lly His Lys 
290 295 300 



ttc age gtg tec ggc gag ggc gag ggc gat gec acc tac ggc aag ctg 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
305 310 3X5 



320 



960 



1008 



acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc 
Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
325 330 335 

acc etc gtg acc acc ctg acc tgg ggc gtg cag tgc ttc age cgc tac 1056 
Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr 
340 345 3so 

ccc gac cac atg aag cag cac gac ttc. ttc aag tec gec atg ccc gaa 1104 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
355 360 3 65 

ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac tac 1152 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
3 7° 375 380 

aag acc cgc gec gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc 1200 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
385 390 395 405 

ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg ggg 1248 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
405 410 415 

cac aag ctg gag tac aac tac ate age cac aac gtc tat ate acc gee 1296 
His Lys Leu Glu Tyr Asn Tyr He Ser His Asn Val Tyr He Thr Ala 
42 ° 425 430 

gac aag cag aag aac ggc ate aag gee aac ttc aag ate cgc cac aac 1344 
Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His Asn 
435 440 445 

ate gag gac ggc age gtg cag etc gee gac cac tac cag cag aac acc 1392 
lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
450 455 46O 

ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age 1440 
Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
465 470 475 480 

acc cag tec gec ctg age aaa gac ccc aac gag aag cgc gat cac atg 1488 
Tnr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
485 490 , 495 

gtc ctg ctg gag ttc gtg acc gee gec ggg ate act etc ggc atg gac 1536 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
500 505 sio 

gag ctg tac aag atg tct act gtc cae gaa ate ctg tgc aag etc age 1584 
Glu Leu Tyr Lys Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser 
515 520 525 
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ttg gag ggt gtt cat tct aca ccc cca agt gcc gga tec 
Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
530 535 540 



<210> 34 
<211> 541 
<212> PRT 
. <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 

YTP-NLS-CP3 -multiple DEVD-CFP-Annexin II construct 

<400> 34 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 3 0 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 .75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 go 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
H5 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 I 50 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
I 65 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
"0 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
i9S 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
2i0 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 
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Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 



245 



255 



Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

Asp Ala Gly Ser Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Glv 
275 280 285 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lvs 
290 295 300 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
305 310 315 320 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
325 330 33 5 

Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr 
340 345 350 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
355 360 365 

Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
370 375 380 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arq 
385 390 395 40? 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
405 4io 415 

His Lys Leu Glu Tyr Asn Tyr He Ser His Asn Val Tyr He Thr Ala 
420 425 430 

Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys lie Arg His Asn 
435 440 445 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
450 455 4 6 o 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
465 475 480 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
485 490 495 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
500 505 510 

Glu Leu Tyr Lys Met Ser Thr Val His Glu lie Leu Cys Lys Leu Ser 
515 520 525 

Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Glv Ser 
530 535 540 



<210> 35 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG epitope 
<400> 35 

gactacaaag acgacgacga caaa 

<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG epitope 
<400> 36 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 37 

tacccatacg acgtaccaga ctacgca 

<210> 38 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 38 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 

<210> 39 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 3S 

ccaccagaac cagaaaca 

<210> 40 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 40 

Pro Pro Glu Pro Glu Thr 
1 5 



<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 41 

gcagaagaac aaaaattaat aagcgaagaa gactta 3 g 

<210> 42 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 42 

Ala Glu Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
1 5 io 



<210> 43 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) (717) 

<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 43 

m 11 ? o 9C 339 99 ° 939 gag Ctg ttc acc 999 9tg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

1 5 10 15 

9tc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 
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tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu ProVal Pro Trp Pro Thr 'Leu Val Thr Thr 
50 55 60 



ttc ggc tac ggc ctg cag tgc ttc gec cgc tac ccc gac cac atg aag 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 



80 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



240 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 



384 



ate gac ttc aag gag gac ggc aac. ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe . Lys He Arg His Asn He Glu Asp Gly Ser 
165" 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gec gee ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 44 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 44 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 3Q 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu. Thr Leu Lys Phe He 
35 40 45 . 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
" 70 75 Iq 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 i 25 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

150 155 i 6 o 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 i7 5 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205. 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 .220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 45 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: EGFP 
<400> 45 

m2 £? ill ,12 *?? < tc ** ** a tc ctg «• 

10 .15 
gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 



Met Vai Ser Lys ^ gIu gTu Leu £ Thr gj. & S ^ S Leu 
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.: Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 

■ . 20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys.Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
SO 55 60 

: .1 ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 

; Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
i 85 90 95 

°9 C acc atc ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
, , Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

-i 

; gtg aag ttc gag ggc gac acc ctg gtg aac cgc atc gag ctg aag ggc 384 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
•j H5 120 125 

| atc gac ttc aag gag gac ggc aac atc ctg ggg cac aag ctg gag tac 432 

; He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tvr 

! 130 135 140 



aac tac aac age cac aac gtc tat atc atg gcc gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



460 



ggc atc aag gtg aac ttc aag atc cgc cac aac atc gag gac ggc age 528 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 17Q 175- 

gtg cag etc gcc gac cac tac cag cag aac acc ccc atc ggc gac ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

1^5 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg atc act etc ggc atg gac gag ctg tac aag 717 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 46 
<211> 239 
<212> PRT 

<213> Artificial Sequence 



76 



W ° 00/50872 PCT/USOO/04794 

<220> 

<223> Description of Artificial Sequence: EGFP 
<400> 46 

Met Val Ser Lys G ly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
H5 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
l fi 5 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asi> Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
21° 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lvs 
225 230 235 

<210> 47 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> .. - 

<221> CDS 
. <222> (1) .. (717) 

<220> 

<223> Description of Artificial Sequence: EBFP 
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<400> 47 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc atc ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 ie 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



ctg acc cac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 
Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



65 70 75 



80 



age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



96 



gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



240 



288 



336 



384 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 . 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac ttc aac age cac aac gtc tat ate atg gec gac aag cag aag aac 480 
Asn Phe Asn Ser. His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gec gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp , His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
I 95 200 205 



672 



gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



78 



WO 00/50872 



PCT/US00/04794 



<210> 48 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: EBFP 
<400> 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 so 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Glv 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
19S 200 205 

Ser Lys Asp Pro . Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230. 235 



<210> 49 
<211> 717 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1)..{717) 

<220> 

<223> Description of Artificial Sequence: ECFP 
<400> 49 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 io 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 eo 



80 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



130 us 



140 



48 



96 



144 



192 



ctg acc tgg ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 



288 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
H5 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



480 



aac tac ate age cac aac gtc tat ate acc gee gac aag cag aag aac 
Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gee aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Ala Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac 
Ser Lys Asp Pro Asn 
210 

gtg acc gec gec ggg 
Val Thr Ala Ala Gly 
225 



gag aag cgc gat cac atg 
Glu Lys Arg Asp His Met 
215 

ate act etc ggc atg gac 
lie Thr Leu Gly Met Asp 
230 235 



gtc ctg ctg gag ttc 672 

Val Leu Leu Glu Phe 

220 

gag ctg tac aag 717 
Glu Leu Tyr Lys 



<210> 50 
<211> 239 
<212> PRT 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: ECFP 
<400> 50 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 " 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
W 150 155 160 

Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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<210> 51 
<211> 720 

<212> DNA . 
<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: Fred25 
<400> 51 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 



15 



aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



145 iso 



155 160 



48 



gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gec atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 .90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



480 



gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 i9o 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gec ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 20S 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
. 210 215 220 

9 ? t 9Ct 999 att aCa Cat g3c at 9 gat gaa ctg tac aac tag 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
22 5 230 23S 



<210> 52 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fred25 
<400> 52 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
I 30 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 ISO . 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
19 5 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn 
. 225 230 235 

<210> 53 : 
<211> 14 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1, 4, 5 
substrate recognition sequence 



<400> 53 
tgggaacatg acaa 



<210> 54 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1, 4, 5 
substrate recognition sequence 

<400> 54 

Trp Glu His Asp 



14 



<210> 55 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-1 
substrate recognition sequence 

<400> 55 
tggtttaaag ac 

<210> 56 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-1 
sxibstrate recognition sequence 

<400> 56 

Trp Phe Lys Asp 
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1 



<210> 57 
<211> 12 

<212> DNA . 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 57 
gacgaacacg ac 

<210> 58 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 58 
Asp Glu His Asp 
1 



<210> 59 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 
gacgaagttg ac 

<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 61 
atagaaacag ac 

<210> 62 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 62 
lie Glu Thr Asp 
1 



<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 63 
tgggtaagag ac 



<210> 64 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 64 
Trp Val Arg Asp 
1 

<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 65 
gtagaaatag ac 
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<210> 66 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 66 
Val Glu He Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 67 
gtagaacacg ac 

<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 68 
Val Glu His Asp 
1 



<210> 69 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase 
substrate recognition sequence 

<400> 69 
acagaagtag ac 

<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 70 
Thr Glu Val Asp 
1 

<210> 71 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 71 
atacaagcag ac 

<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 72 
lie Gin Ala Asp 
1 



<210> 73 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 73 
gtagaaacag ac 

<210> 74 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 74 

Val Glu Thr Asp 

* 1 
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<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 75 
ttagaaacag ac 

<210> 76 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase- 8 
substrate recognition sequence 

<400> 76 
Leu Glu Thr Asp 
1 

<210> 77 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 77 
ttagaacacg ac 

<210> 78 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 78 
Leu Glu His Asp 
1 



<210> 79 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 79 

ttagaacacg ac 12 



<210> 80 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:. proCaspase- 9 
substrate recognition sequence 

<400> 80 
Leu Glu His Asp 
1 



<210> 81 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 81 
agccaaaatt ac 



<210> 82 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 82 
Ser Gin Asn Tyx 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 57 
gacgaacacg ac 

<210> 58 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase- 2 
substrate recognition sequence 

<400> 58 
Asp Glu His Asp 
1 



<210> 59 

<211> 12 ( 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 
gacgaagttg ac 

<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 61 
atagaaacag ac 

<210> 62 
<211> 4 
<212> PRT 

<2X3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 62 
He Glu Thr Asp 
1 

<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 63 
tgggtaagag ac 

<210> 64 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pro.Caspase-4 , 5 
substrate recognition sequence 

<400> 64 
Trp Val Arg Asp 
1 



<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 
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<400> 65 
gtagaaatag ac 



<210> 66 
<21l> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 



12 



<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 



<400> 66 
Val Glu He Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 



<400> 67 
gtagaacacg ac 



<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 68 

Val Glu His Asp 

1" 



<210> 69 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 69 
acagaagtag ac 



12 



12 
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<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence': proCaspase-6 
substrate recognition sequence 

<400> 70 
Thr Glu Val Asp 
1 



<210> 71 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 71 
atacaagcag ac 

<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 72 
He Gin Ala Asp 
1 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 73 
gtagaaacag ac 

<210> 74 
<211> 4 
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<212> PRT 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 74 

Val Glu Thr Asp . 
1 



<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 75 
ttagaaacag ac 

<210> 76 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 76 
Leu Glu Thr Asp 
1 



<210> 77 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
. substrate recognition sequence 

<400> 77 
ttagaacacg ac 

<210> 78 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 78 
Leu Glu His Asp 
1 



<210> 79 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 79 
ttagaacacg ac 

<210> 80 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : proCaspase- 9 
substrate recognition sequence 

<400> 80 
Leu Glu His Asp 
1 



<210> 81 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 81 
agccaaaatt ac 

<210> 82 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 
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<400> 82 
Ser Gin Asn Tyr 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 

<210> 84 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 84 
Pro He Val Gin 
1 



<210> 85 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence* 

<400> 85 
atgtttggag ga 

<210> 86 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase. substrate recognition sequence 

<400> 86 

Met Phe Gly Gly 



97 



WO 00/50872 
1 



<210>. 87 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 87 
gcaaaaaaaa ga 

<210> 88 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 88 
Ala Lys Lys Arg 
1 



<210> 89 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 89 
gtgaaaatg 

<210> 90 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 90 
Val Lys Met 

1 - 
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■;) . <210> 91 

<211> 12 
<212> DNA 

<213> Artificial Sequence 

-I ■ ' . 

! . <220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 91 
gacgcagaat tc 

<210> 92 
: ' <2U> 4 

j <212> PRT 

j <213> Artificial Sequence 

;! <220> 

<223> Description of Artificial Sequence: b-Secretase 
; ,| substrate recognition sequence 

<400> 92 
Asp Ala Glu Phe 
1 



<210> 93 

j <211> 15 

| <212> DNA 

; <213> Artificial Sequence 

I 

j <220> 

j <223> Description of Artificial Sequence : Cathepsin D 
j substrate recognition sequence 

<400> 93 

| aaaccagcat tattc 

. . i 

: j <210> 94 

<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
j substrate recognition sequence 

<400> 94 

Lys Pro Ala Leu Phe 
1 5 



<210> 95 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223 > , Description of Artificial- Sequence : Cathepsin D 
substrate recognition sequence, 

<400> 95 
ttcagatta 

<210> 96 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
substrate recognition sequence 

<400> 96 
Phe Arg Leu 
1 



<210> 97 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Matrix 

Metal loprotease substrate recognition sequence 

<400> 97 

ggaccattag gacca 

<210> 98 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Matrix 

Metalloprotease substrate recognition sequence 

<400> 98 

Gly Pro Leu Gly Pro 
1 5 



<210> 99 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 99 
atagaaccag ac 

<210> 100 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
. <220> 

<223> Description of Artificial Sequence: Granzyme fi 
substrate recognition sequence 

<400> 100 
He Glu Pro Asp 
1 

<210> 101 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 101 

atgcccaaga agaagccgac gcccatccag ctgaac 

<210> 102 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 102 

Met Pro Lys Lys Lys Pro Thr Pro He Gin Leu Asn 
1 5 io 

<210> 103. 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 
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<400> 103 

atgctggccc ggaggaagcc ggtgctgccg gcgctcacca tcaac 

<210> 104 
<211> 15 
<212> PRT 

<213> Artificial Sequence _ 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 104 

Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu. Thr He Asn 



<210> 105 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 105 

gcctcgcagt ttgaaaca 



<210> 106 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 106 

Ala Ser Gin Phe Glu Thr 
1 5 

<210> 107 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 107 

gcttctcaat ttgaaacg 
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<210> 108 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus /botulium substrate recognition sequence 

<400> 108 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 109 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 109 

gccaaccaac gtgcaaca 

<210> 110 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 110 

Ala Asn Gin Arg Ala Thr 
1 5 

<210> 111 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 111 

gcttctcaat ttgaaacg 

<210> 112 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

. <223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 112 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 113 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition sequence 

<400> 113 

acgaaaaaag ctgtgaaa 



<210> 114 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition sequence 

<400> 114 

Thr Lys Lys Ala Val Lys 
1 5 



<210> 115 
<2ll> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> lis 

gaccagaagc tctctgag 

<210> 116 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 116 

Asp Gin Lys Leu Ser Glu 
1 5 

<210>- 117 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sequence 

<400> 117 

atcgacagga tcatggag 

<210> 118 
<2U> 6 . 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sequence 

<400> 118 

He Asp Arg lie Met Glu 
1 5 

<210> 119 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin P substrate recognition sequence 

<400> 119 

agagaccaga agctctct 

<210> 120 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin F substrate recognition sequence 
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<400> 120 

Arg Asp Gin Lys Leu Ser 
1 5 

<210> 121 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 121 

acgagcgcag ccaagttg 

<210> 122 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 122 

Thr Ser Ala Ala Lys Leu 
1 5 

<210> 123 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/ cytoskele ton target sequence 

<400> 123 

atgtctactg tccacgaaat cctgtgcaag ctcagcttgg agggtgttca ttctacaccc 60 
ccaagtgcc 69 

<210> 124 
<211> 23 
<212> PRT 

<2 13 > Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/ cytoskeleton target sequence 
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<400> 124 

Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Glv Val 
1 5 in ,1 



15 



His Ser Thr Pro Pro Ser Ala 
20 



<210> 125 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inner surface 
of plasma membrane target sequence 

<400> 125 

atgggatgta cattaagcgc agaagacaaa gcagcagtag aaagaagcaa aatgatagac 60 
agaaacttaa gagaagacgg agaaaaagct gctaga 95 

<210> 126 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inner surface 
of plasma membrane target sequence 

<400> 126 

Met Gly Cys Thr Leu Ser Ala Glu Asp Lys Ala Ala Val Glu Arg Ser 
1 5 10 15 

Lys Met He Asp Arg Asn Leu Arg Glu Asp Gly Glu Lys Ala Ala Arg 
20 25 30 



<210> 127 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 127 

agaaggaaac gacaaaag 18 
<210> 128 
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<211> 6 
<212> PRT . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 128 

Arg Arg Lys Arg Gin Lys 
1 5 

<210> 129 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 129 

agaaaacgta tacgtactta cctcaagtcc tgcaggcgga tgaaaagaag tggttttgag 60 
atgtctcgac ctattccttc ccaccttact 90 

<210> 130 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 130 

Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Gys Arg Arg Met Lys Arg 
1 5 10 is 

Ser Gly Phe Glu Met Ser Arg Pro lie Pro Ser His Leu Thr 
20 25 30 

<210> 131 
<211> 87 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mitochondria 
target sequence 

<400> 131 

atgtccgtcc tgacgccgct gctgctgcgg ggcttgacag gctcggcccg gcggctccca 60 
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gtgccgcgcg ccaagatcca ttcgttg 87 



<210> 132 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mitochondria 
target sequence 

<400> 132 

Met Ser Val Leu Thr Pro Leu Leu Leu Arg Gly Leu Thr Gly Ser Ala 
1 5 10 is 

Arg Arg Leu Pro Val Pro Arg Ala Leu lie His Ser Leu 
20 25 



<210> 133 

<211> 99 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 

<400> 133 

atgagcattg ttttaataat tgttattgtg gtgatttttt taatatgttt tttatattta 60 
agcaacagca aagatcccag agtaccagtt gaattaatg 99 



<210> 134 
<211> 33 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 

<400> 134 

Met Ser He Val Leu He He Val He Val Val He Phe Leu He Cys 
1 5 10 15 

Phe Leu Tyr Leu Ser Asn Ser Lys Asp Pro Arg Val Pro Val Glu Leu 
20 25 30 

Met 



<210> 135 
<211> 246 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 135 ; 

atgaggcttc gggagccgct cctgagcggc agcgccgcga tgccaggcgc gtccctacag 60 

cgggcctgcc gcctgctcgt ggccgtctgc gctctgcacc ttggcgtcac cctcgtttac 120 

tacctggctg gccgcgacct gagccgcctg ccccaactgg tcggagtctc cacaccgctg 180 

cagggcggct cgaacagtgc cgccgccatc gggcagtcct ccggggagct ccggaccgga 240 

ggggcc 246 

<210> 136 
<211> 82 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 136 

Met Arg Leu Arg Glu Pro Leu Leu Ser Gly Ser Ala Ala Met Pro Gly 
15 io is 

Ala Ser Leu Gin Arg Ala Cys Arg Leu Leu Val Ala Val Cys Ala Leu 
20 25 30 

His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Ser 
35 40 45 

Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu Gin Gly Gly Ser 
50 55 60 

Asn Ser Ala Ala Ala lie Gly Gin Ser Ser Gly Glu Leu Arg Thr Gly 
65 70 75 80 

Gly Ala 



<210> 137 
<211> 150 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 
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<400> 137 

gaaacaataa gacctataag aataagaaga tgttcttatt ttacatctac agacagcaaa 60 

atggcaattc aattaagatc tccctttcca ttagcattac caggaatgtt agctttatta 120 

ggatggtggt ggtttttcag tagaaaaaaa 150 

<210> 138 
<211> SO 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 

<400> 138 

Glu Thr He Arg Pro He Arg He Arg Arg Cys Ser Tyr Phe Thr Ser 
1 5 10 15 . 

Thr Asp Ser Lys Met Ala lie Gin Leu Arg Ser Pro Phe Pro Leu Ala 
20 25 30 

Leu Pro Gly Met Leu Ala Leu Leu Gly Trp Trp Trp Phe Phe Ser Arg 
35 40 45 

Lys Lys 
50 



<210> 139 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 139 

gccttgcaga agaagctgga ggagctagag cttgatgag 39 

<210> 140 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Nuclear Export 
target sequence 



<400> 140 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
1 5 10 
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<210> 141 
<211> 1024 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: - Size exclusion 
target sequence 

<400> 141 



gccgacctca 


gtcttgtgga 


tgcgttgaca 


gaaccacctc 


cagaaattga 


ooaacraaata 


60 


aagcgagact 


tcatggctgc 


gctggaggca 


gagccctatg 


atgacatcgt 


aaaaaaaact 


120 


gtggagaaaa 


ctgagtttat 


tcctctcctg 


qatqqtqatq 


agaaaaccgg 






tccaaaaaga 


aaccctgctt 


agacactagc 


caggttgaag 


gtatcccatc 


ttctaaacca 


240 


acactcctag 


ccaatggtga 


tcatggaatg 


gaggggaata 


acactgcagg 


gtctccaact 


300 


gacttccttg 


aagagagagt 


ggactatccg 


gattatcaga 


gcagccagaa 


ctaaccaaaa 


360 


gatgcaagct 


tttgtttcca 


gcctcagcaa 


gtgttagata 


ctQaccaqcrc 


toaacccttt 


420 


aacgagcacc 


gtgatgatgg 


tttggcagat 


ctgctctttg 


tctccagtgg 


acccacgaac 


480 


gcttctgcat 


ttacagagcg 


agacaatcct 


tcagaagaca 


qttacqqtat 


octtccctat 


540 


gacccacttg 


cttccacggc 


tgttgtatct 


caggagtggt 


ctgtgggagc 


cccaaactct 


600 


ccatgttcag 


agtcctgtgt 


ctccccagag 


gttactatag 


aaaccctaca 


gccagcaaca 


660 


gagctctcca 


aggcagcaga 


agtggaatca 


gtgaaagagc 


agctgccagc 


taaagcattg 


720 


gaaacgatgg 


cagagcagac 


cactgatgtg 


gtgcactctc 


catccacaga 


cacaacacca 


780 


ggcccagaca 


cagaggcagc 


actggctaaa 


gacatagaag 


agatcaccaa 


gccagatgtg 


840 


atattggcaa 


atgtcacgca 


gccatctact 


gaatcggata 


tgttcctggc 


ccaggacatg 


900 


gaactactca 


caggaacaga 


ggcagcccac 


gctaacaata 


tcatattgcc 


tacagaacca 


960 


gacgaatctt 


caaccaagga 


tgtagcacca 


cctatggaag 


aagaaattgt 


cccaggcaat 


1020 



gata 102 4 

<210> 142 
<211> 566 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 
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<400> 142 

Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu lie 
1 5 10 15 

Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro 
20 25 30 

Tyr Asp Asp He Val Gly . Glu Thr Val Glu Lys Thr Glu Phe lie Pro 
35 40 45 

Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys 
5° 55 60 

Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro 
65 70 75 80 

Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala 
85 90 95 

Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr 
100 105 no 

Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro 
115 120 125 

Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg 
130 135 140 

Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn 
145 150 155 160 

Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly 
165 170 175 

Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu 
180 185 190 

Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser 
195 200 205 

Pro Glu Val Thr lie Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys 
210 215 220 

Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu 
225 230 235 240 

Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser Thr 
245 250 255 

Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He 
260 265 270 

Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin Pro 
275 .280 285 

Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr 
290 295 300 
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Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu Pro 
305 310 315 320 

Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu He 
325 330 335 

Val Pro Gly Asn Asp Thr Thr Ser Pro . Lys Glu Thr Glu Thr Thr Leu 
340 345 350 

Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys 
355 360 365 

Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu He 
370 375 380 

Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro Val 
385 390 395 400 

Ala Gin. Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu 
405 410 415 

Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr 
420 425 430 

Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser 
435 440 445 



Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu 
450 455 . 460 

Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu 
465 470 475 4B0 

Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val Ala 
485 490 . 495 

Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val 
500 505 510 

Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn 
515 520 525 

Ala Asp Leu His Ser Gly Thr Glu Leu lie Val Asp Asn Ser Met Ala 
530 535 540 

Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val 
545 550 555 560 

Pro lie Lys Asp Lys Gly 
565 



<210> 143 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 143 

atgtgggcaa tcgggattac tgttctggtt atcttcatca tcatcatcat cgtgtgggtt 60 
gtc 



63 



<210> 144 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 144 

Met Trp Ala He Gly He Thr Val Leu Val He Phe He He He He 
1 5 10 ' is 



He Val Trp Val Val 
20 



<210> 145 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 145 

atgtgggcga tagggatcag tgtcctggtg atcattgtca tcatcatcat cgtgtggtgt 60 

9 61 

<210> 146 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 146 

Met Trp Ala He Gly He Ser Val Leu Val He lie Val He lie He 
1 5 io is 

He Val Trp Cys 
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20 

<210> 147 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 147 

gacctgcaga agaagctgga ggagctggaa cttgacgag 

<210> 148 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of . Artificial Sequence: Nuclear Export 
target sequence 

<400> 148 

Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
1 5 io 



<210> 149 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 149 
tctaaactg 



<210> 150 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 150 
Ser Lys Leu 
1 
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<210> 151 

<211> 3378 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (3375) 

<400> 151 

atg gcc gac etc agt ctt gtg gat gcg ttg aca gaa cca cct .cca gaa 48 

Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
1 5 10 15 

att gag gga gaa ata aag cga gac ttc atg get gcg ctg gag gca gag 96 
He Glu Gly Glu lie Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 .30 

ccc tat gat gac ate gtg gga gaa act gtg gag aaa act gag ttt att 144 
Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe lie 
35 40 45 

cct etc ctg gat ggt gat gag aaa ace ggg aac tea gag tec aaa aag 192 
Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 55 60 

aaa ccc tgc tta gac act age cag gtt gaa ggt ate cca tct tct aaa 240 
Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 80 

cca aca etc eta gcc aat ggt gat cat gga atg gag ggg aat aac act 288 
Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

gca ggg tct cca act gac ttc ctt gaa gag aga gtg gac tat ccg gat 336 
Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 HO 

tat cag age age cag aac tgg cca gaa gat gca age ttt tgt ttc cag 384 
Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

cct cag caa gtg tta gat act gac cag get gag ccc ttt aac gag cac 432 
Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 140 

cgt gat gat ggt ttg gca gat ctg etc ttt gtc tec agt gga ccc acg 480 
Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 150 155 160 

aac get tct gca ttt aca gag cga gac aat cct tea gaa gac agt tac 528 
Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

ggt atg ctt ccc tgt gac tea ttt get tec acg get gtt gta tct cag 576 
Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 
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gag tgg tct gtg gga gcc cca aac tct cca tgt tea gag tec tgt gtc 624 
Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val 



195 200 



205 



tec cca gag gtt act ata gaa acc eta cag cca gca aca gag etc tec 
Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 



ata gaa gag ate acc aag cca gat gtg ata ttg gca aat gtc acg cag 
He Glu Glu He Thr Lys Pro Asp Val lie Leu Ala Asn Val Thr Gin 
275 280 285 



672 



aag gca gca gaa gtg gaa tea gtg aaa gag cag ctg cca get aaa gca 720 
Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
225 230 235 240 

ttg gaa acg atg gca gag cag acc act gat gtg gtg cac tct cca tec 768 
Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

aca gac aca aca cca .ggc cca gac aca gag gca gca ctg get aaa gac 816 
Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 



864 



cca tct act gaa teg gat atg ttc ctg gcc cag gac atg gaa eta etc 912 
Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

aca gga aca gag gca gcc cac get aac aat ate ata ttg cct aca gaa 960 
Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

cca gac gaa tct tea acc aag gat gta gca cca cct atg gaa gaa gaa 1008 
Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 . 335 

att gtc cca ggc aat gat acg aca tec ccc aaa gaa aca gag aca aca 1056 
He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu' Thr Glu Thr Thr 
340 345 350 

ctt cca ata aaa atg gac ttg gca cca cct gag gat gtg tta ctt acc 1104 
Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 365 

aaa gaa aca gaa eta gcc cca gcc aag ggc atg gtt tea etc tea gaa 1152 
Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

ata gaa gag get ctg gca aag aat gat gtt cgc tct gca gaa ata cct 1200 
He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

gtg get cag gag aca gtg gtc tea gaa aca gag gtg gtc ctg gca aca 1248 
Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 415 
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gaa gtg gta ctg ccc tea gat ccc ata aca aca ttg aca aag gat gtg 1296 
Glu Val Val Leu Pro Ser Asp Pro lie Thr Thr Leu Thr Lys Asp Val 
420 425 430 

aca etc ccc tta gaa gca gag aga ccg ttg gtg acg gac atg act cca 1344 
Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

tct ctg gaa aca gaa atg acc eta ggc aaa gag aca get cca ccc aca 1392 
Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

gaa aca aat ttg ggc atg gec aaa gac atg tct cca etc cca gaa tea 1440 
Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

gaa gtg act ctg ggc aag gac gtg gtt ata ctt cca gaa aca aag gtg 1488 
Glu Val Thr Leu Gly Lys Asp Val Val lie Leu Pro Glu Thr Lys Val 
485 490 495 

get gag ttt aac aat gtg act cca ctt tea gaa gaa gag gta acc tea 1536 
Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 

gtc aag gac atg tct ccg tct gca gaa aca gag get ccc ctg get aag 1584 
Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 520 525 

aat get gat ctg cac tea gga aca gag ctg att gtg gac aac age atg 1632 
Asn Ala Asp Leu His Ser Gly Thr Glu Leu lie Val Asp Asn Ser Met 
530 535 540 

get cca gee tec gat ctt gca ctg ccc ttg gaa aca aaa gta gca aca 1680 
Ala Pro Ala Ser Asp Leu Ala, Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

gtt cca att aaa gac aaa gga act gta cag act gaa gaa aaa cca cgt 1728 
Val Pro lie Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

gaa gac tec cag tta gca tct atg cag cac aag gga cag tea aca gta 1776 
Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 

cct cct tgc acg get tea cca gaa cca gtc aaa get gca gaa caa atg 1824 
Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

tct acc tta cca ata gat gca cct tct cca tta gag aac tta gag cag 1872 
Ser Thr Leu Pro lie Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

aag gaa acg cct ggc age cag cct tct gag cct tgc tea gga gta tec . 1920 
Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 635 640 

egg caa gaa gaa gca aag get get gta ggt gtg act gga aat gac ate 1968 
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Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

act acc ccg cca aac aag gag cca cca cca age cca gaa aag aaa gca 2016 

Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 
660 665 670 



aag cct ttg gec acc act caa cct gca aag act tea aca teg aaa gec 
Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 eBS 



720 



get gee cca cac aaa cgc cct get get gec act get act gec agg cct 
Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 



tct ccc tea act ctt gtt tec act gga cca .agt agt aga agt cca get 
Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 790 795 800 



2064 



aaa aca cag ccc act tct etc cct aag caa cca get ccc acc acc tct 2112 

Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
690- 695 700 

ggt ggg ttg aat aaa aaa ccc atg age etc gec tea ggc tea gtg cca 2160 

Gly Gly Leu Asn Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 
705 710 715 



2208 



tec acc eta cct gee aga gac gtg aag cca aag cca att aca gaa get 2256 

Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro He Thr Glu Ala 
740 745 750 

aag gtt gee gaa aag egg acc tct cca tec aag cct tea tct gec cca 2304 

Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 

gec etc aaa cct gga cct aaa acc acc cca acc gtt tea aaa gee aca 2352 

Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 775 780 



2400 



aca act ctg cct aag agg cca acc age ate aag act gag ggg aaa cct 2448 
Thr Thr Leu Pro Lys Arg Pro Thr Ser He Lys Thr Glu Gly Lys Pro 
805 810 815 

get gat gtc aaa agg atg act get aag tct gec tea get gac ttg agt 2496 
Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 

cgc tea aag acc acc tct gec agt tct gtg aag aga aac acc act ccc 2544 
Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Asn Thr Thr Pro 
835 840 845 

act ggg gca gca ccc cca gca ggg atg act tec act cga gtc aag ccc 2592 
Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

atg tct gca cct age cgc tct tct ggg get ctt tct gtg gac aag aag 2640 
Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 
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865 870 875 880 

ccc act tec act aag cct age tec tct get ccc agg gtg age cgc ctg 

Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 

885 890 895 



aat gca gtc act aaa gca gee ggc tec att gcg agt gca cag aaa ccg 
Asn Ala Val Thr Lys Ala Ala Gly Ser lie Ala Ser Ala Gin Lys Pro 
945 950 955 960 



tgt ggc aat gtt cag att cag aac aag aaa gtg gac ata tec aag gtc 
Cys Gly Asn Val Gin He Gin Asn Lys Lys Val Asp lie Ser Lys Val 
995 1000 1005 

tec tec aag tgt ggg tec aaa get aat ate aag cac aag cct ggt gga 
Ser Ser Lys Cys Gly Ser Lys Ala Asn He .Lys His Lys Pro Gly Gly 
1010 1015 1020 



2688 



gee aca act gtt tct gee cct gac ctg aag agt- gtt cgc tec aag gtc 2736 
Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 

ggc tct aca gaa aac ate aaa cac cag cct gga gga ggc egg gee aaa 2784 
Gly Ser Thr Glu Asn He Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

gta gag aaa aaa aca gag gca get ace aca get ggg aag cct gaa cct 2832 
Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
930 935 940 



2880 



cct get ggg aaa gtc cag ata gta tec aaa aaa gtg age tac agt cat 2928 

Pro Ala Gly Lys Val Gin He Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 975 

att caa tec aag tgt gtt tec aag gac aat att aag cat gtc cct gga 2976 

He Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
980 985 990 



3024 



3072 



gga gat gtc aag att gaa agt cag aag ttg aac ttc aag gag aag gec 3120 
Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu Lys Ala 
1025 1030 1035 1040 

caa gee aaa gtg gga tec ctt gat aac gtt ggc cac ttt cct gca gga 3168 
Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

ggt gee gtg aag act gag ggc ggt ggc agt gag gee ctt ccg tgt cca 3216 
Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

ggc ccc ccc get ggg gag gag cca gtc ate cct gag get gcg cct gac 3264 
Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

cgt ggc gee cct act tea gee agt ggc etc agt ggc cac ace ace ctg 3312 
Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 lioo 
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tea ggg ggt ggt gac caa agg gag ccc cag acc'ttg gac age cag ate 3360 
Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin lie 
1105 1110 ins 1120 

cag gag aca age ate taa 3378 
Gin Glu Thr Ser He 
1125 



<210> 152 
<211> 1125 
<212> PRT 

<213> Mus musculus 
<400> 152 

Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
1 5 10 15 

He Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
35 40 45 

Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser . Glu Ser Lys Lys 
• 50 55 60 

Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 80 

Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 140 

Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 .150 155 160 

Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 

Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val 
195 200 205 

Ser Pro Glu Val . Thr lie Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 
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Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
225 230 235 240 

Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

lie Glu Glu lie Thr Lys Pro Asp Val lie Leu Ala Asn Val Thr Gin 
275 280 285 

Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 335 

lie Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 365 

Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 . 415 

Glu Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val 
420 425 430 

Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 



Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

Glu Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val 
485 490 495 

Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 

Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 520 525 
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Asn Ala Asp Leu His Ser Gly Thr Glu Leu He Val Asp Asn Ser Met 
530 535 540 

Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 5 60 

Val Pro lie Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 

Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

Ser Thr Leu Pro He Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 635 640 

Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 



660 



665 



670 



Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
6? 5 .680 685 

Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
69 ° 695 700 

Gly Gly Leu Ash Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 
705 710 .715 720 

Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 

Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro He Thr Glu Ala 
7 *0 745 750 

Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
7 55 760 765 

Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 775 780 



Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 7 9° 795 800 

Thr Thr Leu Pro Lys Arg Pro Thr Ser He Lys Thr Glu Gly Lys Pro 
805 810 815 

Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 8 25 830 
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Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Asn Thr Thr Pro 
835 840 845 

Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 
865 870 875 880 

Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 
885 890 895 

Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 

Gly Ser Thr Glu Asn He Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
^30 935 940 

Asn Ala Val Thr Lys Ala Ala Gly Ser He Ala Ser Ala Gin Lys Pro 
945 950 955 960 

Pro Ala Gly Lys Val Gin He Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 975 

He Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
930 985 990 

Cys Gly Asn Val Gin lie Gin Asn Lys Lys Val Asp He Ser Lys Val 
995 1000 1005 

Ser Ser Lys Cys Gly Ser Lys Ala Asn He Lys His Lys Pro Gly Gly 
1010 1015 1020 

Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu Lys Ala 
1025 1030 1035 1040 

Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055. 

Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1Q80 1085 

Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 

Ser Gly Gly Gly. Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin He 
1105 mo ins 112Q 

Gin Glu Thr Ser He 
1125 
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<210> 153 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 153 

tcatcatccg gagctggagc cggagctggc cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 

<210> 154 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 154 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtacttgc tacttcatc 99 



<210> 155 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 155 

tcatcatccg gagctgga 

<210> 156 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
* <220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 156 

gaagaaggat ccggcact 



126 



WO 00/50872 



PCT/USOO/04794 



<210> 157 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 157 

tcatcatccg gaagaaggaa acgacaaaag cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 

<210> 158 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 158 

tcatcatccg gaagaagg 

<210> 159 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 159 

tcatcatccg gaagaaggaa acgacaaaag cgatcgacaa gacttgttga aattgacaac 60 

<210> 160 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 160 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactgtt gtcaatttc 99 
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<210> 161 
<211> 84 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 161 . 

tcatcatccg gaagaaggaa acgacaaaag cgatcgtatc aaaaaggaat accagttgaa 60 
acagacagcg aagagcaacc ttat 84 

<210> 162 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 162 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactata aggttgctc 99 

<210> 163 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 163 

tcatcatccg gaagaaaacg tatacgtact tacctcaagt cctgcaggcg gatgaaaaga 60 

<210> 164 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 164 

gaagaacgat cgagtaaggt gggaaggaat aggtcgagac atctcaaaac cacttctttt 60 
cat 63 
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<210> 165 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 165 

tcatcatccg gaagaaaa 



<210> 166 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 166 

gaagaacgat cgagtaag 

<210> 167 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1,4,5 
substrate recognition sequence 

<400> 167 
ttagaacatg acaa . 

<210> 168 . 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1, 4, 5 
substrate recognition sequence 

<400> 168 
Leu Glu His Asp 
1 

<210> 169 
<211> 1380 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP27 

<220> 

<221> CDS 

<222> (1)-. .'(1380) 

<400> 169 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag * 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gee gec ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct cga gcg gcg tec aga gca gag tea gee age atg acc 768 
Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

gag cgc cgc gtc ccc ttc teg etc ctg egg ggc ccc age tgg gac ccc 816 
Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

ttc cgc gac tgg tac ccg cat age cgc etc ttc gac cag gee ttc ggg 864 
Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

ctg ccc egg ctg ccg gag gag tgg teg cag tgg tta ggc ggc age age 912 
Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 



960 



1008 



tgg cca ggc tac gtg cgc cec ctg ccc ccc gee gee ate gag age ccc 
Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala He Glu Ser Pro 
305 310 315 320 

gca gtg gee gcg ccc gec tac age cgc gcg etc age egg caa etc age 
Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

age ggg gtc teg gag ate egg cac act gcg gac cgc tgg cgc gtg tec 1056 
Ser Gly Val Ser Glu He Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 350 

ctg gat gtc aac cac ttc gec ccg gac gag ctg acg gtc aag acc aag 1104 
Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

gat ggc gtg gtg gag ate acc ggc aag cac gag gag egg cag gac gag 1152 
Asp Gly Val Val Glu He Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

cat ggc tac ate tec egg tgc ttc acg egg aaa tac acg ctg ccc ccc 1200 
His Gly Tyr He Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

ggt gtg gac ccc acc caa gtt tec tec tec ctg tec cct gag ggc aca 1248 
Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

ctg acc gtg gag gec ccc atg ccc aag eta gec acg cag tec aac gag 1296 
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Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

ate acc ate cca gtc acc ttc gag teg egg gee cag ctt ggg ggc cca 1344 
lie Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

gaa get gca aaa tec gat gag act gee gee aag taa 1380 
Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 

450 455 460 



<210> 170 
<211> 459 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSP27 
<400> 170 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 .15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 ^ 235 2 40 

Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

Leu Pro Arg Leu Pro Glu Glu. Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 

Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala He Glu Ser Pro 
305 310 315 320 

Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

Ser Gly Val Ser Glu He Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 350 

Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

Asp Gly Val Val Glu He Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

His Gly Tyr He Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

He Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 
450 455 



<210> 171 
<211> 2823 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP70 

<220> 

<221> CDS 

<222> (1) . . (2823) 

<400> 171 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala" Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gec gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 



624 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gec gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga atg teg gtg gtg ggc ata gac ctg ggc ttc cag age tgc tac gtc 768 
Gly Met Ser Val Val Gly He Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

get gtg gec cgc gec ggc ggc ate gag act ate get aat gag tat age 816 
Ala Val Ala Arg Ala Gly Gly He Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

gac cgc tgc acg ccg get tgc att tct ttt ggt cct aag aat cgt tea 864 
Asp Arg Cys Thr Pro Ala Cys lie Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

att gga gca gca get aaa age cag gta att tct aat gca aag aac aca 912 
lie Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

gtc caa gga ttt aaa aga ttc cat ggc cga gca ttc tct gat cca ttt 960 
Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

gtg gag gca gaa aaa tct aac ctt gca tat gat att gtg cag tgg cct 1008 
Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 330 335 

aca gga tta aca ggt ata aag gtg aca tat atg gag gaa gag cga aat 1056 
Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 350 

ttt acc act gag caa gtg act gec atg ctt ttg tec aaa ctg aag gag 1104 
Phe Thr Thr Glu Gin Val Thr Ala Met Leu Leu Ser Lys Leu Lys Glu 
355 360 365 

aca gee gaa agt gtt ctt aag aag cct gta gtt gac tgt gtt gtt teg 1152 
Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

gtt cct tgt ttc tat act gat gca gaa aga cga tea gtg atg gat gca 1200 
Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
38S 390 395 400 

aca cag att get ggt ctt aat tgc ttg cga tta atg aat gaa acc act . 1248 
Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

gca gtt get ctt gca tat gga ate tat aag cag gat ctt cct cgc tta 1296 
Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
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420 425 430 

gaa gag aaa cca aga aat gta gtt ttt gta gac atg ggc cac tct get 1344 
Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly His Ser Ala 
435 440 . . 445 

tat caa gtt tct gta tgt gca ttt aat aga gga aaa ctg aaa gtt ctg 1392 
Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 460 

gec act gca ttt gac acg aca ttg gga ggt aga aaa ttt gat gaa gtg 1440 
Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
465 470 475 48O 

tta gta aat cac ttc tgt gaa gaa ttt ggg aag aaa tac aag eta gac 1488 
Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 

att aag tec aaa ate cgt gca tta tta cga etc tct cag gag tgt gag 1536 
He Lys Ser Lys He Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

aaa etc aag aaa ttg atg agt gca aat get tea gat etc cct ttg age 1584 
Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

att gaa tgt ttt atg aat gat gtt gat gta tct gga act atg aat aga 1632 
He Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 
530 535 540 

ggc aaa ttt ctg gag atg tgc aat gat etc tta get aga gtg gag cca 1680 
Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 555 560 

cca ctt cgt agt gtt ttg gaa caa ace aag tta aag aaa gaa gat att 1728 
Pro Leu Arg Ser Val Leu Glu Gin Thr Lys Leu Lys Lys Glu Asp He 
565 570 575 

tat gca gtg gag ata gtt ggt ggt get aca cga ate cct gcg gta aaa 1776 
Tyr Ala Val Glu lie Val Gly Gly Ala Thr Arg He Pro Ala Val Lys 
580 585 590 

gag aag ate age aaa ttt ttc ggt aaa gaa ctt agt aca aca tta aat 1824 
Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 

get gat gaa get gtc act cga ggc tgt gca ttg cag tgt gee ate tta 1872 
Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
610 615 620 

teg cct get ttc aaa gtc aga gaa ttt tct ate act gat gta gta cca 1920 
Ser Pro Ala Phe Lys Val Arg Glu Phe Ser lie Thr Asp Val Val Pro 
625 630 635 640 

tat cca ata tct ctg aga tgg aat tct cca get gaa gaa ggg tea agt 1968 
Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655 
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gac tgt gaa gtc ttt tec aaa aat cat get get cct ttc tct aaa gtt 2016 

Asp Cys. Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

ctt aca ttt tat aga. aag gaa cct ttc act ctt gag gec tac .tac age 2064 

Leu Thr Phe Tyx Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

tct cct cag gat ttg ccc tat cca gat cct get ata get cag ttt tea 2112 

Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala lie Ala Gin Phe Ser 
690 695 700 

gtt cag aaa gtc act cct cag tct gat ggc tec agt tea aaa gtg aaa 2160 

Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 

705 710 715 720 

gtc aaa gtt cga gta aat gtc cat ggc att ttc agt gtg tec agt gca 2208 

Val Lys Val Arg Val Asn Val His Gly lie Phe Ser Val Ser Ser Ala 
725 730 735 

tct tta gtg gag gtt cac aag tct gag gaa aat gag gag cca atg gaa 2256 

Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 .750 

aca gat cag aat gca aag gag gaa gag aag atg caa gtg gac cag gag 2304 

Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 

gaa cca cat gtt gaa gag caa cag cag cag aca cca gca gaa aat aag 2352 

Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
770 775 780 

gca gag tct gaa gaa atg gag ace tct caa get gga tec aag gat aaa 2400 

Ala Glu Ser Glu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 

785 790 .795 800 

aag atg gac caa cca ccc caa tgc caa gaa ggc aaa agt gaa gac cag 2448 

Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 
805 810 815 

tac tgt gga cct gec aat cga gaa tea get ata tgg cag ata gac aga 2496 

Tyr Cys Gly Pro Ala Asn Arg Glu Ser Ala lie Trp Gin He Asp Arg 
820 * . 825 830 

gag atg etc aac ttg tac att gaa aat gag ggt aag atg ate atg cag 2544 

Glu Met Leu Asn Leu Tyr lie Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

gat aaa ctg gag aag gag egg aat gat get aag aac gca gtg gag gaa 2592 

Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

tat gtg tat gaa atg aga gac aag ctt agt ggt gaa tat gag aag ttt 2640 

Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 

865 870 875 880 
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gtg agt gaa gat gat cgt aac agt ttt act ttg.aaa ctg gaa gat act 2688 
Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 895 

gaa aat tgg ttg tat gag gat gga gaa gac cag cca aag caa gtt tat 2736 
Glu Asn Trp Leu Tyr Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 9io 

gtt gat aag ttg get gaa tta aaa aat eta ggt caa cct att aag ata 2784 
Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro lie Lys He 
915 920 925 

cgt ttc cag gaa tct gaa gaa cga cca aat tat ttg aag 2823 
Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 172 

<211> 941 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSP70 

<400> 172 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp .Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu : Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
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165 



170 



175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Met Ser Val Val Gly lie Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

Ala Val Ala Arg Ala Gly Gly lie Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

lie Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 330 335 

Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 350 

Phe Thr Thr Glu Gin Val Thr Ala Met Leu Leu Ser Lys Leu Lys Glu 
355 360 365 

Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
385 390 395 400 

Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
420 425 430 

Glu Glu Lys Pro Arg Asn Val Val Phe Vai Asp Met Gly His Ser Ala 
435 440 445 



Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 460 

Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
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465 470 475 



480 



Leu Val Asn^His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 . 

lie Lys Ser Lys He Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

lie Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Ara 
530 535 540 

Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 ccc 

3:>u 555 560 

Pro Leu Arg Ser Val Leu Glu Gin Thr Lys Leu Lys Lys Glu Asp He 
565 570 575 

Tyr Ala Val Glu lie Val Gly Gly Ala Thr Arg lie Pro Ala Val Lys 
580 585 590 

Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 60S 

Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
610 615 620 

Ser Pro Ala Phe Lys Val Arg Glu Phe Ser He Thr Asp Val Val Pro 
625 "0 635 640 

Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655 

Asp Cys Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala lie Ala Gin Phe Ser 
690 695 700 

Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 
705 710 715 720 

Val Lys Val Arg Val Asn Val His Gly He Phe Ser Val Ser Ser Ala 
725 730 735 

Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 750 

Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 

Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
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770 775 78O 

Ala Glu Ser Glu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 795 



800 



Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 
805 810 815 

Tyr Cys Gly Pro Ala Asn Arg Glu Ser Ala He Trp Gin He Asp Arg 
820 825 " 830 

Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 8 75 8 80 

Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 895 

Glu Asn Trp Leu Tyr Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 910 

Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro He Lys He 
915 920 925 

Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
93 ° 935 940 



<210> 173 
<211> 2674 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSC70 

<220> 

<221> CDS 

<222> (1) . . (2673) 

<400> 173 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro- lie Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc. age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 
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tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 .90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Ash Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gec gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 624 
Pro Val Leu Leu Pro Asp Asn His tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys JUrg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct atg tec aag gga cct gca gtt ggt att gat ctt ggc 768 
Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

acc acc tac tct tgt .gtg ggt gtt ttc cag cac gga aaa gtc gag ata 816 
Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu lie 
260 265 270 
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960 



1008 



att gcc aat gat cag gga aac cga acc act cca age tat gtc gec ttt 864 
He Ala Asn Asp Gin Gly Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe 
275 280 285 

acg gac act gaa egg ttg ate ggt gat gcc gca aag aat caa gtt gca 912 
Thr Asp Thr Glu Arg Leu He Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

atg aac ccc acc aac aca gtt ttt . gat gcc aaa cgt ctg att gga cgc 
Met Asn Pro Thr Asn Thr Val Phe Asp Ala Lys Arg Leu lie Gly Arg 
305 310 315 320 

aga ttt gat gat get gtt gtc cag tct gat atg aaa cat tgg ceo ttt 
Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 . 330 335 

tl? r? If 1 ? aat 93t 9Ct 9gC 399 CCC aag 9tc caa 9ta 9 aa ta <= aag 1056 
Met Val Val Asn Asp Ala Gly Arg Pro Lys Val Gin Val Glu Tyr Lys 

340 345 350 

r?f, Hi 9 ag ° ttC tat CCa 939 339 9t9 tct tct atg 3" ctg H04 

Gly Glu Thr Lys Ser Phe Tyr Pro Glu Glu Val Ser Ser Met Val Leu 

355 360 3 65 

aca aag atg aag gaa att gca gaa gcc tac ctt ggg aag act gtt acc 1152 
Thr Lys Met Lys Glu He Ala Glu Ala Tyr Leu Gly Lys Thr Val Thr 
370 375 380 

aat get gtg gtc aca gtg cca get tac ttt aat gac tct cag cgt cag 1200 
Asn Ala Val Val Thr Val Pro Ala Tyr Phe Asn Asp Ser Gin Arg Gin 
385 390 395 400 

get acc aaa gat get gga act att get ggt etc aat gta ctt aga att 1248 
Ala Thr Lys Asp Ala Gly Thr He Ala Gly Leu Asn Val Leu Arg He 
405 410 415 

att aat gag cca act get get get att get tac ggc tta gac aaa aag 1296 
He Asn Glu Pro Thr Ala Ala Ala lie Ala Tyr Gly Leu Asp Lys Lys 
420 425 430 

S fc ? o? 3 f? 3 9 f a 393 aaC 9tg CtC atc tfct 9ac ct 9 99 a ggt ggc act 1344 
Val Gly Ala Glu Arg Asn Val Leu lie Phe Asp Leu Gly Gly Gly Thr 
435 440 445 

111 f at T? ! ca a ^ c ctc act att gag 9at 3s a atc ttfc gag gtc aag 1392 

Phe Asp Val Ser He Leu Thr He Glu Asp Gly He Phe Glu Val Lys 
450 455 460 



tct aca get gga gac acc cac ttg ggt gga gaa gat ttt gac aac cga 
Ser Thr Ala Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg 
S 470 475 480 

IT 9 . Cat fctt att 9Ct gag ttt aag cgc aag cat aag aag gac 

Met Val Asn His Phe He Ala Glu Phe Lys Arg Lys His Lys Lys Asp 
485 490 495 

atc agt gag aac aag aga get gta aga cgc ctc cgt act get tgt gaa 1536 



1440 



1488 
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He Ser Glu Asn Lys Arg Ala Val Arg Arg Leu Arg Thr Ala Cys Glu 
500 505 510 

cgt get aag cgt acc etc tct tec age ace cag gcc agt att gag ate 1584 
Arg Ala Lys Arg Thr Leu Ser Ser Ser Thr Gin Ala Ser He Glu He 
515 520 . 525 

gat tct etc tat gaa gga ate gac ttc tat acc tec att acc cgt gcc 1632 
Asp Ser Leu Tyr Glu Gly He Asp Phe Tyr Thr Ser He Thr Arg Ala 
530 535 540 



cga ttt gaa gaa ctg aat get gac ctg ttc cgt ggc acc ctg gac cca 
Arg Phe Glu Glu Leu Asn Ala Asp Leu Phe Arg Gly Thr Leu Asp Pro 
545 550 555 



560 



tct gga gac aag tct gag aat gtt caa gat ttg ctg etc ttg gat gtc 
Ser Gly Asp Lys Ser Glu Asn Val Gin Asp Leu Leu Leu Leu Asp Val 
625 630 635 640 

act cct ctt tec ctt ggt att gaa act get .ggt gga gtc atg act gtc 
Thr Pro Leu Ser Leu Gly He Glu Thr Ala Gly Gly Val Met Thr Val 
645 650 655 

etc ate aag cgt aat acc acc att cct acc aag cag aca cag acc ttc 
Leu He Lys Arg Asn Thr Thr He Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 



1680 



gta gag aaa gcc ctt cga gat gcc aaa eta gac aag tea cag att cat 1728 
Val Glu Lys Ala Leu Arg Asp Ala Lys Leu Asp Lys Ser Gin lie His 
565 570 575 

gat att gtc ctg gtt ggt ggt tct act cgt ate ccc aag att cag aag 1776 
Asp lie Val Leu Val Gly Gly Ser Thr Arg He Pro Lys He Gin Lys 
580 585 590 

ctt etc caa gac ttc ttc aat gga aaa gaa ctg aat aag age ate aac 1824 
Leu Leu Gin Asp Phe Phe Asn Gly Lys Glu Leu Asn Lys Ser He Asn 
595 600 60S 

cct gat gaa get gtt get tat ggt gca get gtc cag gca gcc ate ttg 1872 
Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val Gin Ala Ala He Leu 
610 615 620 



1920 



1968 



2016 



act acc tat tct gac aac cag cct ggt gtg ctt att cag gtt tat gaa 2064 
Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 685 

ggc gag cgt gcc atg aca aag gat aac aac ctg ctt ggc aag ttt gaa 2112 
Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

etc aca ggc ata cct cct gca ccc cga ggt gtt cct cag att gaa gtc 2160 
Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 
705 710 715 720 

act ttt gac att gat gcc aat ggt ata etc aat gtc tct get gtg gac 2208 
Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Ser Ala Val Asp 
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725 730 735 

aag agt acg gga aaa gag aac aag att act ate act aat gac aag ggc 2256 
Lys Ser Thr Gly Lys Glu Asn Lys lie Thr lie Thr Asn Asp Lys Gly 
740 745 750 

cgt ttg age aag gaa gac att gaa cgt atg gtc cag gaa get gag aag 23 04 
Arg Leu Ser Lys Glu Asp lie Glu Arg Met Val Gin Glu Ala Glu Lys 
755 760 765 

tac aaa get gaa gat gag aag cag agg gac aag gtg tea tec aag aat 2352 
Tyr Lys Ala Glu Asp Glu Lys Glri Arg Asp Lys Val Ser Ser Lys Asn 
770 775 780 

tea ctt gag tec tat gec ttc aac atg aaa gca act gtt gaa gat gag 2400 
Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Thr Val Glu Asp Glu 
785 790 795 800 

aaa ctt caa ggc aag att aac gat gag gac aaa cag aag att ctg gac 2448 
Lys Leu Gin Gly Lys lie Asn Asp Glu Asp Lys Gin Lys lie Leu Asp 
805 810 815 

aag tgt aat gaa att ate aac tgg ctt gat aag aat cag act get gag 2496 
Lys Cys Asn Glu He He Asn Trp Leu Asp Lys Asn Gin Thr Ala Glu 
820 825 830 

aag gaa gaa ttt gaa cat caa cag aaa gag ctg gag aaa gtt tgc aac 2544 
Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
835 840 845 

ccc ate ate ace aag ctg tac cag agt gca gga ggc atg cca gga gga 2592 
Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Gly Gly 
850 855 860 

atg cct ggg gga ttt cct ggt ggt gga get cct ccc tct ggt ggt get 2640 
Met Pro Gly Gly Phe Pro Gly Gly Gly Ala Pro Pro Ser Gly Gly Ala 
865 870 875 880 

tec tea ggg ccc acc att gaa gag gtt gat taa g 2674 
Ser Ser Gly Pro Thr He Glu Glu Val Asp 
885* 890 



<210> 174 
<211> 890 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSC70 
<400> 174 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 
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Glu Gly Glu Gly. Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 . 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Ash Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe . Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 

He Ala Asn Asp Gin Gly Asn Arg. Thr Thr Pro Ser Tyr Val Ala Phe 
275 280 285 

Thr Asp Thr Glu Arg Leu He Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

Met Asn Pro Thr Asn Thr Val Phe Asp Ala Lys Arg Leu He Gly Arg 
305 310 315 320 

Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 330 335 
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Met Val 

Gly Glu 

Thr Lys 
370 

Asn Ala 
385 

Ala Thr 
lie Asn 
Val Gly 



Phe Asp 
450 

Ser Thr 
465 

Met Val 



lie Ser 



Arg Ala 



Asp Ser 
530 

Arg Phe 
545 

Val Glu 



Asp lie 



Leu Leu 



Pro Asp 
610 

Ser Gly 
625 



Val Asn Asp 
340 

Thr Lys Ser 
355 

Met Lys Glu 
Val Val Thr 



Lys Asp Ala 
405 

Glu Pro Thr 
420 

Ala Glu Arg 
435 

Val Ser lie 



Ala Gly Asp 

Asn His Phe 
485 

Glu Asn Lys 
500 

Lys Arg Thr 
515 

Leu Tyr Glu 
Glu Glu Leu 



Lys Ala Leu 
565 

Val Leu Val 
580 

Gin Asp Phe 
595 

Glu Ala Val 



Asp Lys Ser 



Ala Gly Arg 



Phe Tyr Pro 
360 

lie Ala Glu 
375 

Val Pro Ala 
390 

Gly Thr lie 



Ala Ala Ala 



Asn Val Leu 
. 440 

Leu Thr lie 
455 

Thr His Leu 
470 

He Ala Glu 



Arg Ala Val 

Leu Ser Ser 
520 

Gly He Asp 
535 

Asn Ala Asp 
550 

Arg Asp Ala 
Gly Gly Ser 



Phe Asn Gly 
600 

Ala Tyr Gly 
615 

Glu Asn Val 
630 



Pro Lys 
345 



Val Gin 



Glu Glu Val Ser 



Ala Tyr 
Tyr Phe 



Ala Gly 
410 

He Ala 
425 

He Phe 



Glu Asp 
Gly Gly 



Phe Lys 
490 

Arg Arg 
505 



Leu Gly 
380 

Asn Asp 
395 



Val Glu Tyr Lys 
350 

Ser Met Val Leu 
365 

Lys Thr Val Thr 



Ser Gin Arg Gin 
400 



Leu Asn Val Leu Arg He 
415 



Tyr Gly 

Asp Leu 

Gly He 
460 

Glu Asp 
475 

Arg Lys 
Leu Arg 



Ser Thr Gin Ala 



Phe Tyr 



Leu Phe 



Lys Leu 
570 

Thr Arg 
565 



Thr Ser 
540 

Arg Gly 
555 



Leu Asp Lys Lys 
430 

Gly Gly Gly Thr 
445 

Phe Glu Val Lys 



Phe Asp Asn Arg 
480 

His Lys Lys Asp 
495 

Thr Ala Cys Glu 
510 

Ser He Glu He 
525 

He Thr Arg Ala 



Thr Leu Asp Pro 
560 



Asp Lys Ser Gin lie His 
575 

He Pro Lys He Gin Lys 
590 



Lys Glu Leu Asn 
Ala Ala 



Gin Asp 



Val Gin 
620 

Leu Leu 
635 



Lys Ser lie Asn 
605 

Ala Ala lie Leu 



Leu Leu Asp Val 
640 
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Thr Pro Leu Ser Leu Gly lie Glu Thr Ala Gly Gly val Met Thr Val 
645 650 . 655 

Leu He Lys Arg Asn Thr Thr He Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 6 8 5 

Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 



Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 

720 



7 °5 710 715 



Thr Phe Asp He Asp Ala Asn Gly lie Leu Asn Val Ser Ala Val Asp 
725 730 735 

Lys Ser Thr Gly Lys Glu Asn Lys He Thr He Thr Asn Asp Lys Glv 
740 745 750 

Arg Leu Ser Lys Glu Asp lie Glu Arg Met Val Gin Glu Ala Glu Lys 
755 760 765 

Tyr Lys Ala Glu Asp Glu Lys Gin Arg Asp Lys Val Ser Ser Lys Asn 
770 775 780 

Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Thr Val Glu Asp Glu 
785 790 795 800 

Lys Leu Gin Gly Lys lie Asn Asp Glu Asp Lys Gin Lys He Leu Asp 
805 810 815 

Lys Cys Asn Glu He He Asn Trp Leu Asp Lys Asn Gin Thr Ala Glu 
820 825 830 

Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
83 5 840 845 

Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Glv Glv 
850 855 860 

Met Pro Gly Gly Phe Pro Gly Gly Gly Ala Pro Pro Ser Gly Gly Ala 
865 870 875 880 

Ser Ser Gly Pro Thr He Glu Glu Val Asp 
885 890 



<210> 175 
<211> 2458 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSF1 
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<220> 

<221> CDS 

<222> (1) . . (2349) 

<400> 175 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1.5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 #: 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
1^5 170 175 

gtg cag etc gec gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gee gec ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Qly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 2 40 

gga etc aga tct cga get caa get teg aat tct gca gtc gag atg gat 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

ctg ccc gtg ggc ccc ggc gcg gcg ggg ccc age aac gtc ccg gec ttc 816 
Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 

ctg acc aag ctg tgg acc etc gtg age gac ccg gac acc gac gcg etc 864 
Leu Thr Lys Leu Trp Thr Leu. Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 

ate tgc tgg age ccg age ggg aac age ttc cac gtg ttc gac cag ggc 912 
He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 



cag ttt gec aag gag gtg ctg ccc aag tac ttc aag cac aac aac atg 
Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
305 310 



315 3 20 



380 



aag ate cgc cag gac age gtc acc aag ctg ctg acg gac gtg cag ctg 
Lys He Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
385 390 



395 400 



960 



gee age ttc gtg egg cag etc aac atg tat ggc ttc egg aaa gtg gtc 1008 
Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

cac ate gag cag ggc ggc ctg gtc aag cca gag aga gac gac acg gag 1056 
His He Glu Gin Gly Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 350 

ttc cag cac cca tgc ttc ctg cgf ggc cag gag cag etc ctt gag aac 1104 
Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 

ate aag agg aaa gtg acc agt gtg tec acc ctg aag agt gaa gac ata 1152 
He Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 



1200 



atg aag ggg aag cag gag tgc atg gac tec aag etc ctg gee atg aag 1248 
Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
405 4io 415 

cat gag aat gag get ctg tgg egg gag gtg gec age ctt egg cag aag 1296 
His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
42 <> 425 430 
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cat gcc cag caa cag aaa gtc gtc aac aag etc att cag ttc ctg ate 1344 
His Ala Gin Gin Gin Lys Val Val Asn Lys Leu lie Gin Phe Leu lie 
435 440 445 

tea ctg gtg cag tea aac egg ate ctg ggg gtg aag aga aag ate ccc 1392 
Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
• 450 455 460 

ctg atg ctg aac gac agt ggc tea gca cat tec atg ccc aag tat age 1440 
Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

egg cag ttc tec ctg gag cac gtc cac ggc teg ggc ccc tac teg gcc 1488 
Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

ccc tec cca gcc tac age age tec age etc tac gcc cct gat get gtg 1536 
Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510 

gcc age tct gga ccc ate ate tec gac ate ace gag ctg get cct gcc 1584 
Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

age ccc atg gcc tec ccc ggc ggg age ata gac gag agg ccc eta tec 1632 
Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser 
530 535 540 

age age ccc ctg gtg cgt gtc aag gag gag ccc ccc age ccg cct cag 1680 
Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

age ccc egg gta gag gag gcg agt ccc ggg cgc cca tct tec gtg gac 1728 
Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 575 

acc etc ttg tec ccg acc gcc etc att gac tec ate ctg egg gag agt 1776 
Thr Leu Leu Ser Pro Thr Ala Leu He Asp Ser He Leu Arg Glu Ser 
580 585 590 

gaa cct gcc ccc gcc tec gtc aca gcc etc acg gac gcc agg ggc cac 1824 
Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

acg gac acc gag ggc egg cct ccc tec ccc ccg ccc acc tec acc cct .1872 
Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

gaa aag tgc etc age gta gcc tgc ctg gac aag aat gag etc agt gac 1920 
Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

cac ttg gat get atg gac tee aac ctg gat aac ctg cag acc atg ctg 1968 
His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

age age cac ggc ttc age gtg gac acc agt gcc ctg ctg gac ctg ttc 2016 
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Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 

age ccc teg gtg acc gtg ccc gac atg age ctg cct gac ctt gac age 2064 
Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 680 685 

age ctg gee agt ate caa gag etc ctg tct ccc cag gag ccc ccc agg 2112 
Ser Leu Ala Ser He Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Ara 
690 695 700 ' 

cct ccc gag gca gag aac age age. ccg gat tea ggg aag cag ctg gtg 2160 
Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
705 710 715 720 

cac tac aca gcg cag ccg ctg ttc ctg ctg gac ccc ggc tec gtg gac 2208 
His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Asp Pro Gly Ser Val Asp 
725 730 735 

acc ggg age aac gac ctg ccg gtg ctg ttt gag ctg gga gag ggc tec 2256 
Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

tac ttc tec gaa ggg gac ggc ttc gec gag gac ccc acc ate tec ctg 
Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr He Ser Leu 
755 760 765 



2304 



ctg aca ggc teg gag cct ccc aaa gee aag gac ccc act gtc tec 2349 
Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 

tagaggcccc ggaggagctg ggccagccgc ccacccccac ccccagtgca gggctggtct 2409 
tggggaggca gggcagcctc gcggtcttgg gcactggtgg gteggcegg 2458 

<210> 176 
<211> 783 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSF1 
<400> 176 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 is 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 
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Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 



115 



120 



125 



He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His' Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 "5 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala . Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 

Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 

He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 

Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
305 310 315 320 

Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

His lie Glu Gin Gly. Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 350 

Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 
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lie Lys Arg Lys Val Thr Ser ; Val Ser Thr Leu Lys Ser Glu Asp lie 
370 375 380 

Lys He Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
385 390 395 400 

Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
405 410 415 

His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
420 425 430 

His Ala Gin Gin Gin Lys Val Val Asn Lys Leu He Gin Phe Leu lie 
435 440 445 

Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
450 455 460 

Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510 

Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser 
530 535 540 

Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 575 

Thr Leu Leu Ser Pro Thr Ala Leu lie Asp Ser He Leu Arg Glu Ser 
580 585 590 

Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 
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Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 680 685 

Ser Leu Ala Ser He Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Arg 
690 695 700 

Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
705 710 715 720 

His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Asp Pro Gly Ser Val Asp 
725 730 735 

Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr He Ser Leu 
755 760 765 

Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 



<210> 177 
<211> 2416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-NFKB 

<220> 

<221> CDS 

<222> (1) . . (2415) 

<400> 177 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 - 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gec atg. ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
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85 90 



95 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gaq 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 



110 



155 160 



180 185 



190 



aag acc cac ccc acc ate aag ate aat ggc tac aca gga cca ggg aca 
Lys . Thr His Pro Thr He Lys lie Asn Gly Tyr Thr Gly Pro Gly Thr 
30S 3" 315 320 



336 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Glv 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac'gtc tat ate atg gee gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 



480 



528 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 i 75 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gee gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct cga gat ccg ccc ttc atg gac gaa ctg ttc ccc etc 768 
Gly Leu Arg Ser Arg Asp Pro Pro Phe Met Asp Glu Leu Phe Pro Leu 
245 250 255 

ate ttc ccg gca gag cca gee cag gee tct ggc ccc tat gtg gag ate 816 
He Phe Pro Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He 
260 265 270 

att gag cag ccc aag cag egg ggc atg cgc ttc cgc tac aag tgc gag 864 
He Glu Gin Pro Lys Gin Arg Gly Met Arg phe Arg Tyr Lys Cys Glu 
275 280 285 

ggg cgc tec gcg ggc age ate cca ggc gag agg age aca gat acc acc 912 
Gly Arg Ser Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr 
290 295 300 



960 
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gtg cgc ate tec ctg gtc acc aag gac cct cct cac egg cct cac ccc 
Val Arg He Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro 
325 330 335 



1008 



cac gag ctt gta gga aag gac tgc egg gat ggc ttc tat gag get gag 1056 
His Glu Leu Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu 

340 345 350 

etc tgc ecg gac cgc tgc ate cac agt ttc cag aac ctg gga ate cag 1104 

Leu Cys Pro Asp Arg Cys He His Ser Phe Gin Asn Leu Gly lie Gin 
355 360 365 

tgt gtg aag aag egg gac ctg gag cag get ate agt cag cgc ate cag 1152 

Cys Val Lys Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin 
370 375 380 



acc aac aac aac ccc ttc caa gtt cct ata gaa gag cag cgt ggg gac 
Thr Asn Asn Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp 
385 390 395 400 



aac cga aac tct ggc age tgc etc ggt ggg gat gag ate ttc eta ctg 
Asn Arg Asn Ser Gly Ser Cys Leu Gly Gly Asp Glu He Phe Leu Leu 
450 455 .460 



1200 



tac gac ctg aat get gtg egg etc tgc ttc cag gtg aca gtg egg gac 1248 

Tyr Asp Leu Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp 

405 410 415 

cca tea ggc agg ccc etc cgc ctg ccg cct gtc ctt tct cat ccc ate 1296 

Pro Ser Gly Arg Pro -Leu Arg Leu Pro Pro Val Leu Ser His Pro lie 
420 425 430 

ttt gac aat cgt gee ccc aac act gee gag etc aag ate tgc cga gtg 1344 

Phe Asp Asn Arg Ala Pro Asn Thr Ala Glu Leu Lys He Cys Arg Val 

435 440 445 



1392 



tgt gac aag gtg cag aaa gag gac att gag gtg tat ttc acg gga cca 1440 
Cys Asp Lys Val Gin Lys Glu Asp He Glu Val Tyr Phe Thr Gly Pro 
465 470 475 480 

ggc tgg gag gee cga ggc tec ttt teg caa get gat gtg cac cga caa 1488 
Gly Trp Glu Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin 
485 490 495 

gtg gec att gtg ttc egg acc cct ccc tac gca gac ccc age ctg cag 1536 
Val Ala He Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

get cct gtg cgt gtc tec atg cag ctg egg egg cct tec gac egg gag 1584 
Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

etc agt gag ccc atg gaa ttc cag tac ctg cca gat aca gac gat cgt 1632 
Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 
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cac egg att gag gag aaa cgt aaa agg aca tat gag acc ttc aag age 1680 
His Arg He Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser 
545 550 555 560 

ate atg aag aag agt cct ttc age gga ccc acc gac ccc egg cct cca 1728 
He Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

cct cga cgc att get gtg cct tec cgc age tea get tct gtc ccc aag 1776 
Pro Arg Arg He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 . 590 

cca gica ccc cag ccc tat ccc ttt acg tea tec ctg age acc ate aac 1824 
Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn 
595 600 605 

tat gat gag ttt ccc acc atg gtg ttt cct' tct ggg cag ate age cag 1872 
Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin 
610 615 620 



gee teg gee ttg gec ccg gee cct ccc caa gtc ctg ccc. cag get cca 
Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
^25 630 635 



640 



1920 



gee cct gee cct get cca gee atg gta tea get ctg gee cag gec cca 1968 
Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala* Pro 
645 650 655 

gee cct gtc cca gtc eta gee cca ggc cct cct cag get gtg gec cca 2016 
Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 

cct gee ccc aag ccc acc cag get ggg gaa gga acg ctg tea gag gec 2064 
Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

ctg ctg cag ctg cag ttt gat gat gaa gac ctg ggg gec ttg ctt ggc 2112 
Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 

aac age aca gac cca get gtg ttc aca gac ctg gca tec gtc gac aac 2160 
Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

tec gag ttt cag cag ctg ctg aac cag ggc ata cct gtg gec ccc cac 2208 
Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His 
725 730 735 

aca act gag ccc atg ctg atg gag tac cct gag get ata act cgc eta 2256 
Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 

gtg aca gee cag agg ccc ccc gac cca get cct get cca ctg ggg gee 2304 
Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

cc 9 999 etc ccc aat ggc etc ctt tea gga gat gaa gac ttc tec tec 2352 
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Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 , 775 780 

att gcg gac atg gac ttc tea gec ctg ctg agt cag ate age tec aag 2400 
He Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin lie Ser Ser Lys 
785 790 795 800 

ggc gaa ttc gaa get t 2416 
Gly Glu Phe Glu Ala 
805 



<210> 178 
<211> 805 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-NFKB 
<400> 178 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 is 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 no 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
I 45 150 155 160 

Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
1" 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Asp Pro Pro Phe Met Asp Glu Leu Phe Pro Leu 
245 250 255 

He Phe Pro Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He 
260 265 270 

He Glu Gin Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu 
275 280 285 

Gly Arg Ser Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr 
290 295 300 

Lys Thr His Pro Thr He Lys lie Asn Gly Tyr Thr Gly Pro Gly Thr 
305 310 315 320 

Val Arg He Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro 
325 330 335 

His Glu Leu Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu 
340 345 350 

Leu Cys Pro Asp Arg Cys lie His Ser Phe Gin Asn Leu Gly He Gin 
355 360 365 

Cys Val Lys Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin 
370 375 ' 380 

Thr Asn Asn Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp 
385 390 395 400 

Tyr Asp Leu Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp 
405 • 410 415 

Pro Ser Gly Arg Pro Leu Arg Leu Pro Pro Val Leu Ser His Pro He 
420 425 430 

Phe Asp Asn Arg Ala Pro Asn Thr Ala Glu Leu Lys lie Cys Arg Val 
435 440 445 

Asn Arg Asn Ser Gly Ser Cys Leu Gly Gly Asp Glu lie Phe Leu Leu 
450 455 460 



Cys Asp Lys Val Gin Lys Glu Asp lie Glu Val Tyr Phe Thr Gly Pro 

465 470 475 480 

Gly Trp Glu Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin 
485 490 495 
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Val Ala He Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 

His Arg He Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser 
545 550 555 560 

He Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

Pro Arg Arg lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn 
595 600 605 

Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin 
610 615 620 

Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 «0 635 640 

Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro 
645 650 655 

Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 / 670 

Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 . 685 

Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 

Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His 
725 730 735 

Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 

Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 775 780 



lie Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser Lys 
785 790 795 800 
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Gly Glu Phe Glu Ala 
805 



<210> 179 

<211> 1677 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 

<220> 

<221> CDS 

<222> (1) (1674) 

<400> 179 

atg ttc cag gcg get gag cgc ccc cag gag tgg gec atg gag ggc ccc 48 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
1 5 10 15 

cgc gac ggg ctg aag aag gag egg eta ctg gac gac cgc cac gac age 96 
Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 

ggc ctg gac tec atg aaa gac gag gag tac gag cag atg gtc aag gag 144 
Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

ctg cag gag ate cgc etc gag ccg cag gag gtg ccg cgc ggc teg gag 192 
Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

ccc tgg aag cag cag etc acc gag gac ggg gac teg ttc ctg cac ttg 240 
Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

gee ate ate cat gaa gaa aag gca ctg acc atg gaa gtg ate cgc cag 288 
Ala lie lie His Glu Glu Lys Ala Leu Thr Met Glu Val lie Arg Gin 
85 90 95 

gtg aag gga gac ctg gec ttc etc aac etc cag aac aac ctg cag cag 336 
Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn. Asn Leu Gin Gin 
100 105 no 

act cca etc cac ttg get gtg ate acc aac cag cca gaa att get gag 384 
Thr Pro Leu His Leu Ala Val He Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

gca ctt ctg gga get ggc tgt gat cct gag etc cga gac ttt cga gga 432 
Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg- Asp Phe Arg Gly 
130 135 140 

aat acc ccc eta cac ctt gec tgt gag cag ggc tgc ctg gee age gtg 480 
Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 . 160 
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gga gtc ctg act cag tec tgc acc acc ccg cac etc cac tec ate ttg 52 8 
Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 175 

aag get acc aac tac aat ggc cac acg tgt eta cac tta gee tct ate . 576 
Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
180 185 i9o 

cat ggc tac ctg ggc ate gtg gag ctt ttg gtg tec ttg ggt get gat 624 
His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 200 205 

gtc aat get cag gag ccc tgt aat ggc egg act gee ctt cac etc gca 672 
Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 

gtg gac ctg caa aat cct gac ctg gtg tea etc ctg ttg aag tgt ggg 720 
Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
225 230 235 240 

get gat gtc aac aga gtt acc tac cag ggc tat tct ccc tac cag etc 768 
Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

acc tgg ggc cgc cca age acc egg ata cag cag cag ctg ggc cag ctg 616 
Thr Trp Gly Arg Pro Ser Thr Arg lie Gin Gin Gin Leu Gly Gin Leu 
260 265 270 

aca eta gaa aac ctt cag atg ctg cca gag agt gag gat gag gag age 864 
Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

tat gac aca gag tea gag ttc acg gag ttc aca gag gac gag ctg ccc 912 
Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
230 295 300 

tat gat gac tgt gtg ttt gga ggc cag cgt ctg acg tta acc ggt atg 960 
Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 
305 310 315 320 

get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt gtt 1008 
Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
325 330 335 

gaa tta gat ggt . gat gtt aac ggc cac aag ttc tct gtc agt gga gag 1056 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 350 

ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate tgc 1104 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
355 360 365 

act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc. act act ctg 1152 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
370 375 380 

tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa egg . 1200 
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Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Art? 
385 390 395 400 

cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa agg 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
405 410 Ai c 



tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat gga 

Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 

465 470 475 480 

ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age gtt 

He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val 

485 490 495 

caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc cct 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 

500 505 510 

gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt teg 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

515 520 525 

aaa gat ccc aac gaa aag aga gac cac atg .gtc ctt ctt gag ttt gta 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

530 535 540 



1248 



acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa gtc 1296 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 1344 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

435 440 445 

gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac aac 13 92 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

450 455 4 6 o 



1440 



1488 



1536 



1584 



1632 



aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tag 1677 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 



<210> 180 
<211> 558 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 
<400> 180 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
1 5 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
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Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lvs Glu 
35 40 45 

Leu Gin, Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly ser Glu 
50 55 60 

Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala He He His Glu Glu Lys Ala Leu Thr Met Glu Val He Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asm Leu Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu Ala Val He Thr Asn Gin Pro Glu He Ala Glu 
115 "0 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 a Y 

Asn Thr Pro Leu His Leu' Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 

150 155 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser lie Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
180 1B5 190 

His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asd 
195 200 205 

Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 

Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
225 230 235 240 

Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

Thr Trp Gly Arg Pro Ser Thr Arg He Gin Gin Gin Leu Gly Gin Leu 
260 265 270 

Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
290 295 300 

Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 
305 310 315 320 

Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
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325 



330 



335 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 3so 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
355 360 36 5 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
3 ?0 375 380 

Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
385 390 395 400 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
405 410 415 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
435 440 445 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 



450 



455 



460 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
465 470 475 480 

He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val 
485 490 495 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
500 505 510 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser .Thr Gin Ser Ala Leu Ser 
515 520 525 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
530 535 54Q 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 
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